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Abstract 

I review, on an advanced level, some of the algebraic and geometric structures 
that underlie the theory of Special Relativity. This includes a discussion of rela- 
tivity as a symmetry principle, derivations of the Lorentz group, its composition 
law, its Lie algebra, comparison with the Galilei group, Einstein synchroniza- 
tion, the lattice of causally and chronologically complete regions in Minkowski 
space, rigid motion (the Noether-Herglotz theorem), and the geometry of rotating 
reference frames. Representation-theoretic aspects of the Lorentz group are not 
included. A series of appendices present some related mathematical material. 

This paper is a contribution to the proceedings of the 339th WE-Heraeus- 
Seminar Special Relativity: Will it Survive the Next 100 Years?, edited by 
J. Ehlers and C. Lammerzahl, which will be published by Springer Verlag in 
2006. 
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1 Introduction 



In this contribution I wish to discuss some structural aspects of Special Relativity 
(henceforth abbreviated SR) which are, technically speaking, of a more advanced na- 
ture. Most of what follows is well known, though generally not included in standard 
text-book presentations. Against my original intention, I decided to not include those 
parts that relate to the representation- and field-theoretic aspects of the homogeneous 
and inhomogeneous Lorentz group, but rather to be more explicit on those topics now 
covered. Some of the abandoned material is (rather informally) discussed in l27l . 
All of it will appear in l22l . For a comprehensive discussion of many field-theoretic 
aspects, see e.g. l42ll . 

I always felt that Special Relativity deserves more attention than what is usually 
granted to it in courses on mechanics or electrodynamics. There is a fair amount of 
interesting algebraic structure that characterizes the transition between the Gahlei and 
Lorentz group, and likewise there is some interesting geometry involved in the tran- 
sition between Newtonian (or Galilean) spacetime and Minkowski space. The latter 
has a rich geometric structure, notwithstanding the fact that, from a general relativis- 
tic viewpoint, it is 'just' fiat spacetime. I hope that my contribution will substantiate 
these claims. For the convenience of some interested readers I have included several 
mathematical appendices with background material that, according to my experience, 
is considered helpful being spelled out in some detail. 

2 Some remarks on 'symmetry' and 'co variance' 

For the purpose of this presentation I regard SR as the (mathematical) theory of how to 
correctly implement the Galilean Relativity Principle — henceforth simply abbreviated 
by RP. The RP is a physical statement concerning a subclass of phenomena — those not 
involving gravity — which translates into a mathematical symmetry requirement for the 
laws describing them. But there is no unique way to proceed; several choices need to 
be made whose correctness cannot be decided by mere logic. 

Given that the symmetry requirement is implemented by a group action (which 
may be relaxed; compare e.g. supersymmetry, which is not based on a group), the most 
fundamental question is: what group? In this regard there is quite a convincing string 
of arguments that, given certain mild technical assumptions, the RP selects either the 
Galilei or the Lorentz group (the latter for some yet undetermined velocity parameter 
c). This will be discussed in detail in Section|3] 

Almost as important as the selection of a group is the question of how it should act 
on physical entities in question, like particles and fields. The importance and subtlety 
of this question is usually underestimated. Let us therefore dwell a little on it. 

As an example we consider vacuum electrodynamics. Here the mathematical ob- 
jects that represent physical reality are two spacetime dependent fields, E(x, t] and 
B(x, t), which take values in a vector space isomorphic to M^. There will be certain 
technical requirements on these fields, e.g. concerning differentiability and fall-off at 
spatial infinity, which we do not need to spell out here. For simplicity we shall assume 
that the set of all fields obeying these conditions forms an infinite-dimensional linear 
space /C, which is sometimes called the space of 'kinematical' (or 'kinematically pos- 
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sible') fields. Those fields in /C which satisfy Maxwell's (vacuum) equations form a 
proper subset, 5 C /C, which, due to the linearity of the equations and the boundary 
conditions (fall-off to zero value, say), is a linear subspace. It is called the space of 
'physical' (or 'dynamically possible') fields. Clearly these notions of the spaces of 
kinematically and dynamically possible fields apply to all sorts of situations in physics 
where one considers 'equations of motion' , though in general neither of these sets will 
be a vector space. This terminology was introduced in fl]. 

In general, we say that a group G is a symmetry group of a given dynamical theory 
if the following two conditions are satisfied: 

1. There exists an (say left-) effective action G x /C— )/C, (g,k) i— ) g-k, ofGon 
/C (cf. Sect. lA.lt . Posing effectiveness just means that we do not wish to allow 
trivial enlargements of the group by elements that do not move anything. It also 
means no loss of generality, since any action of a group G on a set /C factors 
through an effective action of G/G ' on /C, where G ' is the normal subgroup 
of trivially acting elements (cf. Sect. lA.ll) . Such an action of the group on the 
kinematical space of physical fields is also called an implementation of the group 
into the physical theory. 

2. The action of G on /C leaves S invariant (as a set, not necessarily pointwise), 
i.e. if s E 5 then g • s G 5 for all g. This merely says that the group action 
restricts from /C to S. Note that, from an abstract point of view, this is the 
precise statement of the phrase 'leaving the field equations invariant', since the 
field equations are nothing but a characterization of the subset S <Z JC. 

If this were all there is to require for a group to count as a symmetry group, then we 
would probably be surprised by the wealth of symmetries in Nature. For example, in 
the specific case at hand, vacuum electrodynamics, we often hear or read the statement 
that the Lorentz group leaves Maxwell's equations invariant, whereas the Galilei group 
does not. Is this really true? Has anyone really shown in this context that the Galilei 
group cannot effectively act on K, so as to leave S invariant? Certainly not, because 
such an action is actually known to exist; see e.g. Chap. 5.9 in |21|. Hence, in the 
general sense above, the Galilei group is a symmetry group of Maxwell's equations! 

The folklore statement just alluded to can, however, be turned into a true statement 
if a decisive restriction for the action is added, namely that it be local. This means 
that the action on the space of fields is such that the value of the transformed field at 
the transformed spacetime point depends only on the value of the untransformed field 
at the untransformed point and not, in addition, on its derivatives.^ This is the crucial 
assumption that is implicit in all proofs of Galilean-non-invariance of Maxwell's equa- 
tions and that is also made regarding the Lorentz group in classical and quantum field 
theories. The action of the Galilei group that makes it a symmetry group for Maxwell's 
equations is, in fact, such that the transformed field depends linearly on the original 
field and its derivatives to all orders. That is, it is highly non local. 

Returning to the general discussion, we now consider a classical field, that is, a 
map ^ : M — ) V from spacetime M into a vector space V. A spacetime symmetry- 
group has an action on M, denoted by T : (g,x) i— > Tg(x), as well as an action on 

' Here one should actually distinguish between 'ultralocality', meaning not involving any derivatives, 
and 'locality', meaning just depending on derivatives of at most finite order. 
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V, which in most cases of interest is a linear representation g i— > D(g) of G. A local 
action of G on field space is then given by 

(g,^)^ g-V:=D(g]o^oTg-\ (1) 

where here and below the symbol o denotes composition of maps. This is the form 
of the action one usually assumes. Existing generalizations concerning possible non- 
linear target spaces for W and/or making W a section in a bundle, rather than just a 
global function, do not influence the locality aspect emphasized here and will be ig- 
nored. 

Next to fields one also considers particles, at least in the classical theory. Struc- 
tureless (e.g. no spin) particles in spacetime are mathematically idealized by maps 
y : M — > M, where E (or a subinterval thereof) represents parameter space. The 
parameterization usually does not matter, except for time orientation, so any reparam- 
eterization f : M — ) M with f > gives a reparameterized curve y' := y o f which is 
just as good. On the space of particles, the group G acts as follows: 

(g,y) ^ g -y := Tg oy . (2) 

Together (Q and ^ define an action on all the dynamical entities, that is particles and 
fields, which we collectively denote by the symbol O. The given action of G on that 
space is simply denoted by (g, O) i— > g • O. 

Now, the set of equations of motion for the whole system can be written in the 
general form 

^:[i,o] = o, (3) 

where this should be read as a multi-component equation (with being the zero 'vec- 
tor' in target space). L stands collectively for non-dynamical entities (background 
structures) whose values are fixed by means independent of equation It could, for 
example, be the Minkowski metric in Maxwell's equations and also external currents. 
The meaning of Q is to determine O, given L (and the boundary conditions for O). 
We stress that Z is a constitutive part of the equations of motion. We now make the 
following 

Definition 1. An action of the group G on the space of dynamical entities O is said to 
correspond to a symmetry of the equations of motion iff^ for all g G G we have 

£[L,O]=0 ^ £[L,g-O]=0. (4) 

Different form that is mere 'covariance', which is a far more trivial requirement. 
It arises if the space of background structures, L, also carries an action of G (as it 
naturally does if the L are tensor fields). Then we have 

Definition 2. An action of the group G on the space of dynamical and non-dynamical 
entities O and L is said to correspond to a covariance of the equations of motion iff 
for all g G G we have 

£[L,(D]=0 ^ £[g-L,g-(^]=0. (5) 

^ Throughout we write 'iff as abbreviation for 'if and only if. 
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The difference to symmetries is that the background structures — and in that sense 
the equations of motion themselves — are changed too. Equation Q says that if O 
solves the equations of motion then g • O solves the very same equations. In contrast, 
^ merely tells us that if O solves the equations of motion then g • O solves the 
appropriately transformed equations. Trivially, a symmetry is also a covariance but the 
converse it not true. Rather, a covariance is a symmetry iff it stabilizes the background 
structures, i.e. if for all g G G we have g • Z = Z. 

Usually one has a good idea of what the dynamical entities (]? in ones theory should 
be, whereas the choice of Z is more a matter of presentation and therefore conventional. 
After all, the only task of equations of motion is to characterize the set S of dynam- 
ically possible fields (and particles) amongst the set K, of all kinematically possible 
ones. Whether this is done by using auxiliary structures or Z2 should not affect 
the physics. It is for this reason that one has to regard the requirement of mere covari- 
ance as, physically speaking, rather empty. This is because one can always achieve 
covariance by suitably adding non-dynamical structures Z. Let us give an example for 
this ||2J. 

Consider the familiar heat equation. 



Here the dynamical field O = T is the temperature function. The background structure 
is the 3-dimensional Euclidean metric, 5, of space which enters the Laplacian, A := 
5'^^9a9b; hence Z = 5. This equation possesses time translations and Euclidean 
motions in space (we neglect space reflections for simplicity) as symmetries. These 
form the group E3 = x: S0(3), the semi-direct product of spatial translations and 
rotations. Clearly E3 stabilizes 6. 

But without changing the physics we can rewrite ^ in the following spacetime 
form: Let (x°,x\x^,x^) = (ct,x,y,z) be inertial coordinates in Minkowski space 
and TL = 9t = ti*^9^ (i.e. n*^ = (c, 0,0,0)) the (covariant) constant vector field 
describing the motion of the inertial observer. The components of the Minkowski 
metric in these coordinates are denoted by g^-,,. In our conventions ('mostly minus') 
{g^-v} = diag(l,-l,-l,-l). Then ^ is clearly just the same as 



Here the dynamical variable is still O = T but the background variables are now given 
by Z = (n, g). This equations is now manifestly covariant under the Lorentz group if 
n^^ and g ^^"^ are acted upon as indicated by their indices. Hence we were able to enlarge 
the covariance group by enlarging the space of Zs. In fact, we could even make the 
equation covariant under general diffeomorphisms by replacing partial with covariant 
derivatives. But note that the symmetry group would still be that subgroup that stabi- 
lizes (leaves invariant) the (flat) metric g and the (covariant constant) vector field u, 
which again results in the same symmetry group as for the original equation For 
more discussion concerning also the problematic aspects of of the notion of 'general 
covariance', see e.g. 1231 . 



9tT - kAT = . 



(6) 




g^^) 9^9^7 = 0. 



(7) 
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3 The impact of the relativity principle on the 
automorphism group of spacetime 



In the history of SR it has often been asked what the most general transformations of 
spacetime were that implemented the relativity principle (RP), without making use of 
the requirement of the constancy of the speed of light. This question was first addressed 
by Ignatowsky l3Tl . who showed that under a certain set of technical assumptions 
(not consistently spelled out by him) the RP alone suffices to arrive at a spacetime 
symmetry group which is either the Galilei or the Lorentz group, the latter for some 
yet undetermined limiting velocity c. More precisely, what is actually shown in this 
fashion is, as we will see, that the spacetime symmetry group must contain either the 
proper orthochronous Galilei or Lorentz group, if the group is required to comprise 
at least spacetime translations, spatial rotations, and boosts (velocity transformations). 
What we hence gain is the group-theoretic insight of how these transformations must 
combine into a common group, given that they form a group at all. We do not learn 
anything about other transformations, like spacetime reflections or dilations, whose 
existence we neither required nor ruled out on this theoretical level. 

The work of Ignatowsky was put into a logically more coherent form by Franck & 
Rothe II91I20I . who showed that some of the technical assumptions could be dropped. 
Further formal simplifications were achieved by Berzi & Gorini |8|. Below we shall 
basically follow their line of reasoning, except that we do not impose the continuity 
of the transformations as a requirement, but conclude it from their preservation of the 
inertial structure plus bijectivity. See also |l3l for an alternative discussion on the level 
of Lie algebras. 

The principles of SR are mathematically most concisely expressed in terms of few 
simple structures put onto spacetime. In SR these structures aie absolute in the sense 
of not being subject to any dynamical change. From a fundamental point of view, 
it seems rather a matter of convention whether one thinks of these structures as pri- 
marily algebraic or geometric. According to the idea advocated by Felix Klein in his 
'Erlanger Programm' ll36l . a geometric structure can be characterized by its automor- 
phism group^. The latter is generally defined by the subgroup of bijections of the set 
in question which leaves the geometric structure — e.g. thought of as being given in 
terms of relations — invariant. Conversely, any transformation group (i.e. subgroup of 
group of bijections) can be considered as the automorphism group of some 'geometry' 
which is defined via the invariant relations. 

The geometric structure of spacetime is not a priori given to us. It depends on the 
physical means on which we agree to measure spatial distances and time durations. 
These means refer to physical systems, like 'rods' and 'clocks', which are themselves 
subject to dynamical laws in spacetime. For example, at a fundamental physical level, 
the spatial transportation of a rod or a clock from one place to another is certainly a 
complicated dynamical process. It is only due to the special definition of 'rod' and 
'clock' that the result of such a process can be summarized by simple kinematical 
rules. Most importantly, their dynamical behavior must be 'stable' in the sense of being 
essentially independent of their dynamical environment. Hence there is always an 
implicit consistency hypothesis underlying operational definitions of spatio-temporal 

Klein calls it 'Hauptgmppe'. 
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measurements, which in case of SR amount to the assumption that rods and clocks are 
themselves governed by Lorentz invariant dynamical laws. 

A basic physical law is the law of inertia. It states the preference of certain types of 
motions for force-free, uncharged, zero-spin test-particles: the 'uniform' and 'rectilin- 
ear' ones. In the spacetime picture this corresponds to the preference of certain curves 
corresponding to the inertial worldlines of the force-free test particles. In the gravity- 
free case, we model these world lines by straight lines of the affine space Aff (M^) over 
the vector space M'*. This closely corresponds to our intuitive notion of homogeneity 
of space and time, that is, that there exists an effective and transitive (and hence simply 
transitive) action of the Abelian group M"* of translations (cf. Sects. lA. fl and lA.Vl i. A lot 
could (and perhaps should) be said at this point about the proper statement of the law 
of inertia and precisely how it endows spacetimes with certain geometric structures. 
Instead we will simply refer the interested reader to the hterature; see e.g. I.26J and 
references therein. 

Note that we do not conversely assume any straight line to correspond to some 
inertial world-line. Hence the first geometric structure on spacetime, which can be 
thought of as imposed by the law of inertia, is that of a subset of straight lines. If all 
straight lines were involved, the automorphism group of spacetime would necessarily 
have to map any straight line to a straight line and therefore be a subgroup of the affine 
group M"* XI GL(4, M). This is just the content of the main theorem in affine geometry; 
see e.g. However, we can only argue that it must map the subset of inertial world- 
lines onto itself. We take this subset to consist of all straight lines in Aff (M'*) whose 
slope with respect to some reference direction is smaller than a certain finite value (3. 
This corresponds to all worldlines not exceeding a certain limiting speed with reference 
to some inertial frame. It is then still true that any bijection^ of Aff (M^) 

preservmg 

that subset must be a subgroup of the affine group l29l . Also, it is not necessary to 
assume that lines map surjectively onto lines 1 16 1. 

For further determination of the automorphism group of spacetime we invoke the 
following principles: 

STl: Homogeneity of spacetime. 

ST2: Isotropy of space. 

ST3: Galilean principle of relativity. 

We take STl to mean that the sought-for group should include all translations and 
hence be of the form Mf^ xi G, where G is a subgroup of GL(4, M). ST2 is interpreted 
as saying that G should include the set of all spatial rotations. If, with respect to some 
frame, we write the general element A G GL(4,M) in a 1 + 3 split form (thinking of 

If one drops the assumption of bijectivity, then there exist in addition the fractional linear transforma- 
tions which map straight lines to straight lines, except for those points that are mapped to 'infinity' ; see 
e.g. the discussion in Fock's book 1 18|, in particular his Appendix A, and also |20|. One might argue 
that since physics takes place in the finite we cannot sensibly argue for global bijectivity and hence 
have to consider those more general transformations. However, the group they generate does not have 
an invariant bounded domain in spacetime and hence cannot be considered as the automorphism group 
of any fixed set of physical events. 
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the first coordinate as time, the other tliree as space), we want G to include all 



R(D) 




) 



where D G S0(3) . 



(8) 



Finally, ST3 says that velocity transformations, henceforth called 'boosts', are also 
contained in G. However, at this stage we do not know how boosts are to be repre- 
sented mathematically. Let us make the following assumptions: 

Bl: Boosts B(v) are labeled by a vector v G Bc(M^), where Bc(M^) is the open 
ball in of radius c. The physical interpretation of v shall be that of the boost 
velocity, as measured in the system from which the transformation is carried out. 
We allow c to be finite or infinite (B(X)(M^) = R^). v = corresponds to the 
identity transformation, i.e. B(0) = idig4. We also assume that v, considered as 
coordinate function on the group, is continuous. 

B2: As part of ST2 we require equivariance of boosts under rotations: 



The latter assumption allows us to restrict attention to boost in a fixed direction, say 
that of the positive x-axis. Once their analytical form is determined as function of 
V, where v = vCx, we deduce the general expression for boosts using ^ and (ISj. 
We make no assumptions involving space reflections.^ We now restrict attention to 
V = vCx. We wish to determine the most general form of B (v) compatible with all 
requirements put so far. We proceed in several steps: 

1. Using an arbitrary rotation D around the x-axis, so that D • v = v , equation Q 
allows to prove that 



where here we wrote the 4x4 matrix in a 2 + 2 decomposed form. (i.e. A(v) 
is a 2 X 2 matrix and I2 is the 2x2 unit-matrix). Applying ^ once more, this 
time using a 7t-rotation about the y-axis, we learn that a is an even function, i.e. 



Below we will see that a(v) = 1 . 

Some derivations in the literature of the Lorentz group do not state the equivariance property J9j explic- 
itly, though they all use it (implicitly), usually in statements to the effect that it is sufficient to consider 
boosts in one fixed direction. Once this restriction is effected, a one-dimensional spatial reflection 
transformation is considered to relate a boost transformation to that with opposite velocity. This then 
gives the impression that reflection equivariance is also invoked, though this is not necessary in space- 
time dimensions greater than two, for j9j allows to invert one axis through a 180-degree rotation about 
a perpendicular one. 



R(D) • B(v) • R(D"^) = B(D -v) . 



(9) 




(10) 



a(v) = a(— v) . 



(11) 
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2. Let us now focus on A(v], which defines the action of the boost in the t — x 
plane. We write 

(:)-(::) -<^'-(:)Kc!r.i:l)-(:)- - 

We refer to the system with coordinates (t,x) as K and that with coordinates 
(t',x') as K'. From (fT2b and the inverse (which is elementary to compute) one 
infers that the velocity v of K' with respect to K and the velocity v' of K with 
respect to K' are given by 

V = - c(v)/d(v) , (13a) 
v' = - vd(v)/a(v) =: (p(v) . (13b) 

Since the transformation K' — > K is the inverse of K — > K', the function cp : 
(— c,c) — > (— c,c) obeys 

A((p(v)) = [AM)-' . (14) 
Hence cp is a bijection of the open interval (— c, c) onto itself and obeys 

cp o cp = id(_c,c) • (15) 

3. Next we determine cp. Once more using Q, where D is a 7t-rotation about the 
y-axis, shows that the functions a and d in dlOt are even and the functions b 
and c are odd. The definition (I13bl) of (p then implies that cp is odd. Since we 
assumed v to be a continuous coordinatization of a topological group, the map 
cp must also be continuous (since the inversion map, g h-) g^^, is continuous in 
a topological group). A standard theorem now states that a continuous bijection 
of an interval of M onto itself must be strictly monotonic. Together with (fTSb 
this implies that cp is either the identity or minus the identity map.^ If it is 
the identity map, evaluation of ( fT4b shows that either the determinant of A(v) 
must equals —1, or that A(v) is the identity for all v. We exclude the second 
possibility straightaway and the first one on the grounds that we required A(v) 
be the identity for v = 0. Also, in that case, (fT4li implies A^{v) = id for all 
V G (— c, c). We conclude that cp = —id, which implies that the relative velocity 
of K with respect to K' is minus the relative velocity of K' with respect to K. 
Plausible as it might seem, there is no a priori reason why this should be so.'. 
On the face of it, the RP only implies (fT5l . not the stronger relation cp(v) = —v. 
This was first pointed out in [ SJ- 

4. We briefly revisit (fTTl . Since we have seen that B(— ve^) is the inverse of 
B(vex), we must have a(— v) = 1 /a(v), so that (fTTt implies a(v) = ±1. But 
only a(v) = +1 is compatible with our requirement that B(0) be the identity. 

^ The simple proof is as follows, where we write v' := cp{v) to save notation, so that J15> now reads 
v" = V. First assume that cp is strictly monotonically increasing, then v' > v implies v = v" > v', 
a contradiction, and v' < v implies v = v" < v', likewise a contradiction. Hence tp = id in this 
case. Next assume cp is strictly monotonically decreasing. Then cp := — (p is a strictly monotonically 
increasing map of the interval (— c,c) to itself that obeys jl5> . Hence, as just seen, cp = id, i.e. 
cp = —id. 

' Note that v and v' are measured with different sets of rods and clocks. 
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5. Now we return to the determination of A(v). Using (flSl and cp = 



—id, we write 



A(v) = ( ^(^.^ , ) (16) 

\-va[v) a[v]J 

and 

A(v) := det(A(v)) = a(v) [a(v) + vb(v)] . (17) 
Equation A(— v) = (A(v))^^ is now equivalent to 

a(-v) = a[v)/A[v) , (18a) 
b(-v) = -b(v)/A(v) . (18b) 

Since, as already seen, a is an even and b is an odd function, (fTSb is equivalent to 
A(v) = 1, i.e. the unimodularity of B(v). Equation (flTl then allows to express 
b in terms of a: 



b(v) = 



(19) 



6. Our problem is now reduced to the determination of the single function a. This 
we achieve by employing the requirement that the composition of two boosts in 
the same direction results again in a boost in that direction, i.e. 

A(v) •A(v') =A(v"). (20) 

According to (fT6b each matrix A(v) has equal diagonal entries. Applied to the 
product matrix on the left hand side of d20t this implies that v^^(a^^(v) — 1 ] 
is independent of v, i.e. equal to some constant k whose physical dimension is 
that of an inverse velocity squared. Hence we have 

a(v) = J—^ , (21) 

where we have chosen the positive square root since we require a(0) = 1 . The 
other implications of (l20t are 

a(v)a(v')(l -kw') = a(v") , (22a) 
a(v)a(v')(l +vv') =v"a[v"), (22b) 

from which we deduce 

Conversely, (|2lJ and d23b imply (l22l . We conclude that d20l is equivalent to 
(EB and (El. 

7. So far a boost in x direction has been shown to act non-trivially only in the t — x 
plane, where its action is given by the matrix that results from inserting (IT9l and 
dlTT) into (fT6l : 

A(v] = f '^(^.^ ^^'^S^^') where a(v) = l/TTTl^. (24) 
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• If k > we rescale t h-> t := t/ \/k and set \/kv := tan a. Then (l24t is 
seen to be a Euchdean rotation with angle a in the t — x plane. The veloc- 
ity spectrum is the whole real line plus infinity, i.e. a circle, corresponding 
to a G [0,27t], where and In are identified. Accordingly, the composi- 
tion law d23l is just ordinary addition for the angle a. This causes several 
paradoxa when v is interpreted as velocity. For example, composing two 
finite velocities v,v' which satisfy vv' = 1 /k results in v" = oo, and 
composing two finite and positive velocities, each of which is greater than 
1 / \/k, results in a finite but negative velocity. In this way the successive 
composition of finite positive velocities could also result in zero velocity. 
The group G C GL(n, R) obtained in this fashion is, in fact, S0(4). This 
group may be uniquely characterized as the largest connected group of bi- 
jections of M'* that preserves the Euclidean distance measure. In particular, 
it treats time symmetrically with all space directions, so that no invariant 
notion of time-orientability can be given in this case. 

• For k = the transformations are just the ordinary boosts of the Galilei 
group. The velocity spectrum is the whole real line (i.e. v is unbounded 
but finite) and G is the Galilei group. The law for composing velocities is 
just ordinary vector addition. 

• Finally, for k < 0, one infers from (l23l that c := 1 / \/— k is an upper bound 
for all velocities, in the sense that composing two velocities taken from 
the interval (— c, c) always results in a velocity from within that interval. 
Writing t := ct, v/c =: (3 =: tanh p, and y = 1 / \/l — (3^, the matrix 
(l24l) is seen to be a Lorentz boost or hyperbolic motion in the t — x plane: 

('^\^( ^ -|3y\ M ^ / coshp -sinhp\ /t\ 
Xxj (3y y y \xy sinhp cosh p j \xy ' 

The quantity 

p := tanh"^ (v/c) = tanh"^ ( (3 ) (26) 

is called rapidity^ . If rewritten in terms of the corresponding rapidities the 
composition law d23l reduces to ordinary addition: p" = p + p'. 

This shows that only the Galilei and the Lorentz group survive as candidates for 
any symmetry group implementing the RP. Once the Lorentz group for velocity pa- 
rameter c is chosen, one may prove that it is fully characterized by its property to leave 
a certain symmetric bilinear form invariant (cf. Sect lA.4t . Endowing spacetime with 
that structure plus the affine structure from the law of inertia, we can characterize the 
Lorentz group as automorphism group of some geometric structure. This is often the 
starting point of more axiomatic approaches. Here we preferred to start with the op- 
posite strategy, which stresses that the geometry of spacetime is a contingent physical 
property, emerging through its automorphism group, which in turn relates to the ac- 
tual dynamical laws of nature. Having said that, we may now follow the convenient 
axiomatic line of presentation. 



* This term was coined by Robb 1411 . but the quantity was used before by others; compare 1521 . 
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4 Algebraic structures of Minkowski space 



Definition 3. Minkowski space is the affine space Aff (M'^) over the four-dimensional 
real vector space M"*, where the latter is endowed with a symmetric non-degenerate 
bilinear form g of signature (+,-,-,-) = (1,3). We write MI"* = (AfflM^), g). 
We shall usually restrict to bases {e|^}^=o■■■3 of M"* for which g(e^, e^) =: g^-v = 
diag(1, -1,-1,-1). 

4.1 The Lorentz and the GaUlei group 

Definition 4. The (homogeneous) Lorentz group is the linear group (subgroup of 
GL(4,M)) of orthogonal transformations of M"*, also called 0(1,3). Hence G 
0(1,3) iff 

g^,L^jL^ = gap. (27) 

Note that according to Propositionl20l orthogonal transformations are necessarily 
linear. 

As topological space 0(1 ,3) decomposes into the disjoint union of four connected 
components. Here +/— stands for positive/negative determinant and t / i for time- 
orientation preserving/reversing respectively: 

0(1,3) =0^(1,3) U 0^(1,3) U0L(1,3) U 0^(1,3). (28) 

S0(1,3) 

Of these four components only 0^(1,3), the component containing the group iden- 
tity, is a subgroup, called the group of proper orthochronous Lorentz transformations. 

T I 

Elementwise composition with space/time reflections gives 0L(1 ,3)/0_(1 ,3) respec- 
tively. In the sequel we shall also write Lor for 0(1 ,3) and Lor^ for 0^(1 ,3). 

For any group G C GL(n, M), there is a corresponding inhomogeneous group, IG, 
given by the semi-direct product 

IG ={(a,A) I aeM^, AG G], (29) 

where 

(a,A)(a',A') = (a + A-a', A-A'). (30) 
It can again be thought of as subgroup of GL(n + 1 , M) via the embedding 

(a,A)^(y;). (31) 

In this fashion we get the inhomogeneous Lorentz groups ILor and ILor!,. also called 
Poincare groups. 

Let us recall the structure of the proper orthochronous homogeneous Galilei group, 
which we denote by Gal^. It is generated by spatial rotations x i— > x' = D • x and 
boosts X I— > x' = X + vt (t' = t in both cases). Hence, if we agree to let rotations act 
first and then act with the boosts, the general form of a matrix in Gal]|_ C GL(4,M) 
will be (written in a 1 + 3 decomposition): 
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Hence, given any pair (v,D], this tells us how to uniquely construct the matrix 
G(v, D] G Gal^. Conversely, given a matrix G G Gal^, we can immediately tell 
V G and D G SO (3] by comparison with the general form (l32l . Hence there is 
a bijection of sets G : x SO (3) — ) Gal^. The group structure on x SO that 
makes this into an isomorphism of groups is a semi-direct product: 

G(vi,Di) • G(V2,D2) = G(vi +Di •V2, Di - Da) . (33) 

Hence we have an isomorphism Gal^ = x S0(3). This also follows straightaway 
from comparing (I31t with (l32l . From (I33t the law for taking the inverse is easily 
deduced: 

{G[v,D))-' = G(~D-' ■v,D-') (34) 

The inhomogeneous Galilei group is now isomorphic to an iterated semi direct 
product: 

IGal^ := Qa\\ = e4 X (E^ X S0(3)) , (35) 

where corresponds to space-time translations and to boost. The action of Ga.\\. 
on R.^ is via the 'defining representation', i.e. the obvious action of 4 x 4 matrices of 
the form (l32l on Mf^. Not that this 4-dimensional representation of Gal^ is reducible: 
it transforms the 3-dimensional subspace of 'spatial' vectors (0, a)^ into themselves. 
Hence the semi-direct product of Gal!,_ with the subgroup of pure spatial translations, 
isomorphic to M^, is a proper subgroup of IGal!|_ that properly contains Gal!)_: Gal^ C 
X Gal!|_ C IGal^. In other words: Gal^ is not a maximal^ subgroup of IGall^.. 
Hence another way to write IGal^ as semi-direct product is 

IGal^ = (M^xM3) X (MxS0(3)). (36) 

where the first two on the right hand side correspond to spatial translations and 
boosts respectively, and the single M to time translations. The action of M x S0(3) on 

X is the factor-wise standard action of S0(3) on and the trivial action of M. 

At this point we can already anticipate some major group-theoretic differences 
between the Galilei and the Lorentz groups (denoted by Lor). For example: 

1. Lor!|_ is a simple group, that is, it does not contain any normal subgroup other 
than the trivial ones (itself and the unit element). The set of pure boost does 
not form a subgroup. In contrast, Gal^ is not even semi-simple, meaning that it 
contains a non-trivial Abelian normal subgroup, namely the boosts. This makes 
a big difference in the corresponding representation theories. 

2. In ILor^ = 1^4 ^ |_q,.T^ action of Lor^ on M"* is irreducible and Lor\_ is 
a maximal subgroup of ILor^, in contrast to the Galilean case. This makes 
a difference for the existence of invariant equivalence relations on spacetime 
(cf. Sect. lA.ll i. like, for example, absolute simultaneity structures. This will be 
further discussed in Sect. l5.3l 

' A proper subgroup G ' C G is maximal if there is no subgroup H of G such that G ' C H C G . 
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4.2 Polar decomposition 

In d32l we have given an easy proof-by-inspection of the unique decomposability of 
any element in Gal^ into a product of a rotation and a boost. We now like to discuss 
the analog of this decomposition within Lor^, which is more difficult to obtain. We 
start by recalling the statement and proof of the 'polar decomposition' of matrices: 

Proposition 1. Let X e GL(n,C); then there exists a unique R E U(rL) (i.e. R^^ = R^^ j 

and a unique positive-definite Hermitian matrix V> (i.e. B = B^^ with strictly positive 
eigenvalues) such that 

X = B • R . (37) 

IfX(^ GL(rL,M) then B is real, symmetric, and positive definite. R is real and orthog- 
onal. 

Proof Let A := XX^ which is positive-definite and Hermitean (zero eigenvalues are 
excluded since X is invertible). Recall that the square-root is a well defined bijective 
map (a homeomorphism in fact) of the space of positive-definite Hermitean matrices 
onto itself. Define B := ^/A and R := B^^X, then Rt = X+B^^ = X^^B = R-\ 
where the first equality follows from Hermiticity of B and the second from B^ = XX^ 
Hence R is unitary and we have shown existence of a polar decomposition. To show 
uniqueness, assume there exist two such decompositions: X = BiRi = B2R2- Then 
Bi = B2R3, where R3 := RiR^^ is again unitary. Hermiticity of Bi _2 and unitarity of 
R3 now imply B^ = BiB^| = B2R3R3B2 = B2 and hence Bi = B2, since 'squaring' 
is an injective map (a homeomorphism in fact) from the space of positive-definite 
Hermitean matrices onto itself. This, in turn, implies Ri = R2 and hence uniqueness. 
Finally, if X is real, then B and consequently R are also real. □ 

We wish to apply this to Lor^ C GL(4,M). But note that polar decomposing an 
element in G C GL(n,C) need not generally lead to factors in G. However, this is 
true in many cases. For example, we have 

Proposition 2. Let E'^''''' be the diagonal matrix whose first p diagonal entries equal 
+1 and the remaining q = n — p diagonal entries equal We define the group 

U(p,q) :={X G GL(n,C) | X • Et^'"' • X^ = E'^''''}. (38) 

Restricting to matrices with real entries gives the group 0(p,q). Polar decomposing 
elements o/U(p,q] or 0(p,q) leads to factors within these groups respectively. The 
same is true if we restrict to the identity components of these groups. 

Proof. It is sufficient to prove that B = VxxJ is in U(p,q) since this clearly im- 
plies that the product R = B^^X will also be in U(p,q). Now, E'^'*^' G U(p,q) (it 
clearly satisfies the defining relation in (I38t ) so that X^ and hence XX^ ai^e elements 
in U(p,q). But then VxxJ G U(p,q), too. To see this, use e.g. the exponential 
map (cf. Sect. lA.lOl . which defines a homeomorphism from the space of Hermitean 
to the space of positive-definite Hermitean matrices. Then X = exp(Y) G U(p,q] 
£(p,q) . Y . = ^ 0( = exp(Y/2) G U(p,q). Finally it is clear from the 

explicit construction of the polar factors that if X(s) is a continuous path connecting 
the identity to X, and ifX(s)=B(s)R(s)is the polar decomposition for each value of 
s, then B(s) and R(s) are continuous paths connecting B and R to the identity. □ 
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We will use this to decompose any proper orthochronous Lorentz transformation 
L into a boost B and a proper spatial rotation R. Let 

I m) 

be a Lorentz transformation. The defining relation dZTl . as well as the relation 
L^L^g*^^ = g"-^ which follows from it, are equivalent respectively to 

d^ = y^-^ , yb = M • a , M • = I3 + b b"^ , (40a) 
b^ = - 1 , ya = • b , • M = I3 + a a"^ . (40b) 

The polar decomposition of the matrix L G 0(1 , 3)^ in ( l39t is given by 

L = B • R (41) 

with 

B=(;^ 13+%)' ^=(5 M-^)- ^'^^ 



Tho check this, first verify that L is indeed the product B • R using the relations HOi . 
Next we note that B is symmetric and that its eigenvalues (EV) are all positive: 



EV(B) = (y + Vy^-1,y-Vy2- 1,1,1) > 0. (43) 
Finally one checks that R is a spatial rotation, i.e. 

D:=M-^^j^ G S0(3). (44) 
1 +y 

Indeed, D • = I3 is easily verified using the relations (I40t and det(D) = 1 follows 
from det(L) = det(B) = 1. Hence we have found the polar decomposition of L G 

0^(1, 3). 

We can now characterize the factors B (boost) and R (rotation) of L in terms of the 
parameters y , a, b, M in d39b . We start with R: Using the first and second equation in 
(I40al) one readily shows that 

D • a = b . (45) 

Hence the plane of rotation for D is span{a, b} C M^. The rotation angle 9 obeys 



a-b 



cos 9 = — — - , (46a) 



where we used = b^ = y^ — 1 (first equations in (l40t ). On the other hand, it 
evidently also obeys the general equation 1 + 2 cos 9 = trace (D), i.e. 

a-b 

1 + 2 cos 9 = trace(M] - . (46b) 

1 +y 



Note that the analogous factorization <32t of a homogeneous Gahlei transformation into boost and 
rotation is not given by polar decomposition, but rather by a decomposition into a lower triangular 
matrix with unit diagonal (the boost) and an orthogonal matrix. This is a special case of what is 
generally known as Iwasawa decomposition. 
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Elimination of a • b via (I46at gives 



trace (M) - 1 

cos = . (46c) 

1 +y 



Next we set p := b/y, (3 := ||(3||, and p := p/(3; then 



y = y(|3) := 1/0 -132, b=yP, a = yDT.|3. (47) 

Writing B in terms of (3 explicitly shows that it is a boost with parameter (3 = v/c. 

The general Lorentz transformation ( l39l . instead of being considered as function 
of y , a, b, M obeying (l40b . can now be considered as function of (3 and D, 

1-(|3,D) =[ \ ^"^^^ 1 I ;^ 1 ' (48) 

' - l3 + (y-1)(3®|3T ' ''^ ^ ' 




=:B(|3) 

where y is now understood as function of (3 as in (l47b . The only restrictions on the 
parameters being that D G SO (3) and (3 G 5i C M^, where Si denotes the ball of unit 
radius centered at the origin (cf. (l69l ). The decomposition (l48l should be regarded as 
the analog of (I32t . 

It is easy to check directly that the boost are indeed equivariant with respect to 
rotations: 

R(D)-B(|3)-R(D-M =B(D-|3). (49) 

The polar decomposition is unique once the order of rotations and boosts are fixed. In 
W\\ we had put the rotations to the right, i.e. one first rotates and then boosts (we think 
actively). Had we chosen the opposite order the rotation parameter would still be D 
but the boost parameter would change to • |3. This follows immediately from P9l . 

4.3 The Lie algebras of the Lorentz and Galilei groups 

The commutation relations of the Lorentz Lie-algebra follow from the general formula 
(I195t . where we have to set e = 1 . Here we shall rename the generators Mah, where 
a, b G {0,1,2,3}, in the way explained below (indices l,j,k are in {1,2,3}). For 
direct comparison with (fT95l we also give their expression in terms of the defining 
representation (i.e. as elements of End(M'^)). So let {ea}a=o- -3 and {r["^}a=o- -3 be dual 
bases of M"* andrja := Qabil^. where Qab '■= Qi^a, 6b) (cf. Sect. IA.5l l. Then: 



Ji 
Ki 

Pi 
E 



2^ij]MjTc = eijkej^Tiic, (50a) 

iMio = ^(ei®Tio-eo«)rii), (50b) 

Ti = Ci, (50c) 

c To = ceo . (50d) 



These generate active rotations, boosts, translations in space, and translations in time 
respectively. The reason for the factors of 1 /c in (I50bt and c in (I50dl) is as follows: We 
wish the to be the generators of boosts with velocity parameters (rather than |3^ = 
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vVc), i.e. exp(Mi^o|3^) = exp(K|V^). Similarly, we wish E to be the generator of time 
translation with parameter At (rather than = cAt), i.e. exp(ToAx'^) = exp(EAt). 
This puts the and E in quantitative analogy to the corresponding generators in the 
Galilei group and hence facilitates a direct comparison. 
The relations (11951) now amount to 



[Ji, Jj] 


= £xjic Jk 


(51a) 


[Ji.Kj] 


= £ijkKk 


(51b) 


[Ki.Kj] 


= -Eijklk/C^ 


(51c) 


[Ji,Pj] 


= £ijkPk 


(5 Id) 


[Ji,E] 


= 


(51e) 


[Ki,Pj] 




(51f) 


[Ki,E] 


= Pi 


(51g) 


[Pi.Pj] 


= 


(51h) 


[Pi,E] 


= 0. 


(51i) 



Those involving J| on the left hand side just tell us that the other quantity in the bracket 
is either a spatial vector or scalar. According to (15 lab the Ji form a Lie subalgebra but, 
as e.g. (I51bt shows, not an ideal (cf. Sect. lA.Q.Tl . In contrast, (15 let shows that the 
Ki do not form a Lie subalgebra. The Jt, Kt span the Lie algebra of 0(1 ,3) and it is 
easy to prove from the first three relations above that it is simple (has no non-trivial 
ideals). Moreover, any of the ten generators appears on the right hand side of some 
relation (15 It . i.e. can be written as a commutator. This means that the Lie algebra of 
the inhomogeneous Lorentz group is perfect (i.e. generated by commutators). 

Another fact easily seen from (l51aB51ct is that the Lie algebra of the homogeneous 
Lorentz group is the 'complex double' (my terminology, see below) of the Lie algebra 
of SO (3). Let us explain this. Given a real Lie algebra L of dimension n, we consider 
the real vector space C ^ L of dimension 2rL. Here C is considered as two-dimensional 
real vector space and is clearly also taken over M. C L can be made into a real 2n- 
dimensional Lie algebra by defining [z-[®X-[ , Z2(8'X2] := ziZ2(X' [Xi , X2] andM-linear 
extension. This is easily checked to satisfy all axioms (11841) . The complex double of L 
is now defined to be the real Lie algebra C ® L. For sure, C (8) L has a natural complex 
structure, which allows to consider it as n-dimensional complex Lie algebra. In this 
case we'^ would call it L*^, the complexification of L. However, we are interested in 
Lie algebras of Lie groups, which a priori are always considered as real (cf. Sect. lA.9l l. 
regardless of the possible existence of a complex structure. Now let L be the Lie 
algebra of S0(3), i.e. L = spanjei , 62, 63} where [et, ej] = eijk^k- Consider C (8) L 
and set Rj := 1 (g) ej and cKj := i (g) e,, so that C L = span{Ri , R2, R3, Ki , K2, K3}. In 
this basis the Lie brackets are just given by (I51al51cl) . showing that the homogeneous 
Lorentz Lie-algebra is indeed the complex double of the Lie algebra of S0(3). 

The Lie algebra of the inhomogeneous Galilei group is formally obtained from 
dSTT i by taking the limit 1 /c^ — ) 0, to that the right hand sides of \5\c\ and (I51fl) 
are now replaced with zero. This causes big structural changes. For example, the 

" The terminology used here is non-standard. Often the distinction between C(8)L and L"" is not explicitly 
made, and even if it is, both are called 'the complexification' of L. 
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generators of boosts now generate an Abelian ideal in the homogeneous Galilei Lie- 
algebra (generated by R|, K^), implying that it is not even semisimple, whereas we 
just said that the homogeneous Lorentz Lie-algebra is simple. In the inhomogeneous 
Galilei Lie-algebra the and P^^ together generate an Abelian ideal. It is not perfect 
since the Kt and E do not occur on the right hand sides anymore. 

One might argue that it is physically incorrect to take E/c^ to zero in the limit 
c — > oo. Rather, E — > oo as c — > oo since E contains a contribution mc^ from the rest- 
energy of the system (ra denotes the rest mass, which we wish to keep at a finite value). 
Hence, for an isolated system, one should rather set E = mc^ + Eq and therefore have 
E/c^ — ) m in the limit as c — > c«. Then the right hand side of (15 let is still zero in this 
limit but the right hand side of (151ft becomes proportional m 6ij, where m is now read 
as a new element of the Lie algebra that commutes with all other elements, i.e. lies in 
the center (in any irreducible representation it is therefore written as ml where 1 is the 
unit operator). Also, due to ra being central, ( |51g| l is maintained with Eq replacing E. 

The 11 -dimensional Lie algebra so obtained is well known. It is a central exten- 
sion of the inhomogeneous Galilei Lie-algebra, out of a unique 1 -parameter family of 
inequivalent central extensions, labeled by the value of ra. As is well known, it is 
this extension (and the corresponding 1 1 -dimensional Lie group, sometimes called the 
Schrodinger group), which implement the Galilean symmetries in quantum mechan- 
ics by proper representations, whereas the inhomogeneous Galilei group only acts by 
ray-representations. Formally, the central element ra then gives rise to superselection 
rules. There are certain analogs of this on the classical level; see l24l . 

The formal process by which the (inhomogeneous and homogeneous) Galilei Lie- 
algebra emerges from the Lorentz Lie-algebra is a special case of what is called a 
contraction, which was introduced in |32 1 just in order to understand precisely the way 
in which the Galilei Lie-algebra and group can be understood as limiting case of the 
Lorentz Lie-algebra and group respectively. The general idea can be briefly described 
as follows: Consider a Lie algebra L with decomposition into two linear subspaces 
L = H © H', none of which we a priori assume to be a Lie subalgebra. Choose an 
adapted basis {Xi , • • • X^, Xp • • • X^,} such that the unprimed elements span H and the 
primed elements H'. The Lie brackets have the general form 



We now rescale the primed generators, leaving the unprimed ones untouched. 




Ca'b'Xc + C^/^/X^/ . 



(52b) 



(52a) 



(52c) 




X 




(53a) 
(53b) 



and write down (l52l in terms of the new basis: 



[Ya,Yb] 
[Ya,Y;,] 



[Yq') Yb' 





'b' 'c 



Yc + 



+ 



+ 



£ *^Qb'c'> 
pc' Y' 



(54b) 



(54c) 



(54a) 
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We wish to formally take the limit e — > 0. Clearly this cannot be done unless the terms 
oc e^^ in (I54at all vanish, i.e. unless C^^, = 0, which is equivalent to saying that 
H := span{Xi , • • • X^} must be a Lie-subalgebra of L. Assuming that this is the case, 
the limit can be taken and the following Lie algebra emerges: 

[Ya,Yb] = C^bYc, (55a) 
[Ya,Y;,] = C^.'b'Yc', (55b) 
[Y;,,Y^,] = 0. (55c) 

Thus we see that in the limit the subalgebra H survives whereas the linear space H' 
turns into an Abelian ideal. Hence the limit Lie algebra is a semi-direct sum of the 
original Lie subalgebra H with the Abelian ideal H'. It is called the contraction ofL 
over H, since H stays intact and the rest is contracted. On the level of Lie groups one 
might think of the contracted group (the group generated by H') as an infinitesimal 
neighborhood of the group one contracts over (the group generated by H) within the 
full Lie group (the group generated by L). 

This applies to the transition Lorentz — > Galilei as follows: In the homoge- 
neous case, we decompose the Lorentz Lie-algebra into the Lie subalgebra H = 
span{Ji , J2, J3} and the linear subspace H' = span{Ki , K2, K3}, and then contract it 
over H to obtain the homogeneous Galilei Lie-algebra. In the inhomogeneous case we 
set H = span{Ji, J2, J3, E}, which is indeed a Lie subalgebra as seen from dSTT i. and 
H' = span{KT , K2, K3, Pi , P2, P3}. Contracting over H then just results in making H' 
Abelian, i.e. annihilating the right hand sides of (15 let and (15 1ft . which just results in 
the inhomogeneous Galilei Lie-algebra. Its structure as semi-direct sum with H' as 
Abelian ideal is just the Lie-algebra analog of the semi-direct product structure (I36t . 



4.4 Composing boosts 

After this digression into Lie algebras we return to the level of groups. More specifi- 
cally, we are now interested in the composition of two boosts, B((3i) and B((32). The 
matrix product can be easily computed using the explicit form of B((3) as given in 
(l48l . We set := y((3t) for i = 1,2 and denote by piy and pi^ the components of 
pi parallel and perpendicular to the other velocity respectively. The angle between pi 

and p2 is denote by cp, i.e. Pi • P2 = cos cp. Then the matrix product has the general 
form ( I39t . where 

y = yiyi(l + Pi • P2) = yiyi(l + PiP2cos(p) , (56a) 
a = yiy2(P2+Pi||+y2^Pi±), (56b) 
b =yiy2(Pi + P2||+y7^P2±), (56c) 

M = l3 + (yi -l)Pi P| + (y2-l)p2»pj 

+ (PiyiP2y2 + (yi-l)(y2-l)Pi - Pi) Pi ®pI. (56d) 

The resulting boost and rotation parameters will be called p = pi * P2 and D = 
T[pi , P2] respectively. Hence we have: 

B(Pi)-B(p2) = B(pi*p2)-R(T[Pi,P2]). (57) 
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The operation * entails the law of how to compose velocities in SR. T[|3i , pa] is called 
the 'Thomas rotation'. Its existence (i.e. it being non trivial) means that pure boosts 
do not form a subgroup in the Lorentz group, in contrast to the Galilei group. 

The functional form of the * operation follows from d56l . since pi * (32 = b/y: 

Pi*p2 = ; — ^ — ^ • (58) 

1 +Pt P2 

Comparing (I56bl) with (I56ct shows a/y = (32 * (3 1 . Equation (l45l then shows 

(3i*(32 = T[|3i,|32] • ((32*(3i), (59) 

which in turn implies (we write T^^ [— , — ] for the inverse matrix (T[— , — ] ) 

T[(3i,(32] =T-i[(32,(3i]. (60) 

Let now D G S0(3] be any rotation; then dSSl shows that * obeys 

(D- (3i)*(D- (32) =D- ((3i*(32), (61) 

which combined with d59l also shows that 

T[D • Pi , D • (32] = D • T[(3i , (SJ • . (62) 

The Thomas rotation takes place in the plane span{a, b} = span{Pi , P2}. The 
cosine of the angle of rotation, 9, follows from (146c I) and (I56dt : 

n 1 (TI - 1)(T2- 1) .2 ,^0 ^ 

COS 6 = 1 ; sm CD , (63a) 

y + 1 

where we used (I56al) to eliminate a term oc cos cp. It shows that T[(3i , (32] = I3 iff (3i 
and p2 are either parallel (cp = 0) or anti-parallel (cp = n). We can now again make 
use of (I56at to eliminate y in favor of y 1 , y2, and cos cp, so as to make cos 9 a function 
of the moduli Pi , P2 of the velocities and the angle cp between them: 

Q 1 (yi - l)(y2- l)sin^(p 
cos 9 = 1 . (63b) 



1 +T1T2 + Y(yf- i)(y2 1) ^^^v 

Alternatively we can use ( I56at to express cos 9 as function of the tree moduli pi , P2, 
and p = ||(3i * (32||, which assumes a nice symmetric form:^^ 

cos 9 = — — — - 1 . (63c) 

(1 +t)(1 +Ti)(l +72) 

Figure^illustrates the laws d58t and d59l of the Thomas rotation for a special case 
in which the two velocities are perpendicular. In such cases 9 ranges between and 
7t/2, as can be immediately deduced from (I63bt . The sense of the Thomas rotation in 

This derivation, albeit straigiitforward, is a little tedious. A more elegant derivation, using Clifford 
algebra, is given in 1491 . 
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Pi 

Figure 1: Addition of perpendicular velocities (3i and Pi of modulus |3i = (32 = 0.78 
so that y^^ = yj^ = 5/8. In this case d58l gives pi ★ pi = pi + |(32 and likewise 
p2* P 1 = P2+ 1 P 1 . For comparison, the dashed arrow corresponds to the 'classically' 
composed velocities (vector addition). According to ( l59t . the rotation T[Pi , P2] turns 
p2 * pi into Pi ★ P2, as indicated by the curved arrow. 



the pi p2-plane is negative (we orient this plane in the usual way, such that pi x p2 
defines the direction of the normal). 

Generally 9 ranges between and n. More precisely, take fixed moduli P 1 and 
p2 and consider cos as function of cp as given by (I63bl) . For cp = and cp = 7t 
this function has obvious maxima (where cos 9 = 1) and hence must have a minimum 
inbetween, which corresponds to a maximal value of 9. Using (I63bt we compute that 
this maximum of 9 occurs at a value cpra which obeys 



(Tl-DlTz-') 

(the negative sign shows that cpm. > 7t/2) and that the maximal value Q^n obeys 

cosem=1-2- — — — =-cos2cpra. (65) 

(Yi + 1)(Y2 + 1) 

Hence we see that becomes larger than n/2 for sufficiently large values of yi and 
y2. For example, if pi = p2 = P, i-C- yi = y2 = T. the value of p above which 
Qrtv exceeds 7t/2 is given by 2^/4/(2 V2 + 2) « 0.985. Equation ^ also shows that 
Qfn approaches its maximal value, tt, only if yi and tend to infinity. In general, 
(I63bt shows that in that limit cos 9 approaches cos cp, which means that 9 approaches 
27r — cp, since the Thomas rotation is in the negative sense relative to the orientation of 
the pi-p2 plane. 

Finally, using (l49l and dSTl . we can now write down the general composition law 
for Lorentz transformations: 

L(Pi,Di) • L(P2,D2) =L(Pi *Di • P2, T[Pi,Di • PJ -Di -02) . (66) 
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Moreover, noting that (B(P]) ^=B(— (3), equations (I41l49t also show that 



(L(|3,D]) ' =L(-D-i.|3,D-^). (67) 

Note that (l67l and d34b are just the same analytic operations on the parameter spaces. 
The multiplication law d66t now replaces the semi-direct product structure d33l of the 
Galilei group, into which it turns in the limit c — ) oo. Indeed, writing (3 = v/c, the 
operation * between the v's approaches + and the Thomas rotation T[— , — ] becomes 
the identity, as one e.g. sees from (I63bt for y i , 72 — > 1 . 

4.5 The algebraic structure of velocity composition 

Let us say a little more about the algebraic structure behind d58l . First of all, * defines 
a map 

*■.B^y^B^^B^, (Pi , Pi) ^ Pi ★ pi , (68) 

where 

B^ :={P G M? \ ||P|| < 1} (69) 

is the open ball in 3-dimensional Euclidean space (here space of velocities/c). That 
its image lies indeed in B-\ C follows from (I56al) . which e.g. implies y < 2yiy2. 
Hence ★ makes Boo into a groupoid (see below). Moreover, for each (3 G Si , we have 

0*p = p*d=p, (70) 

so that is a unit with respect to Each element also has an inverse (left and right): 

P*(-P) = (-P)*P =0. (71) 

We already saw in d59b that the Thomas rotation obstructs commutativity of We 
now show that it also obstructs associativity. Consider the composition of three boosts 
B(pi ) • B(p2) • B(p3) and use associativity of matrix multiplication: 

B(Pi) • (B(P2) • B(P3)) = (B(pi) • B(p2)) • B(p3) . (72a) 

Iterated application of dSTl shows that the left hand side is equal to 

B (p 1 * ( P2 * P3)) • R(T[p 1 , p2 * P3] • T[P2, P3] ) , (72b) 

whereas the right hand side equals (also making use of d49l ). 

B((Pi*P2)*(T[Pi,p2]-p3)) •R(T[Pi*P2,T[pi,p2]-p3] •T[Pi,P2]). (72c) 

Expressions (I72bt and (I72ct are in polar decomposed form. Uniqueness then implies 
equality of the boost and rotation factors separately. For the boosts this implies 

Pi * (p2* p3) = (Pi * P2) ★ (T[Pi , P2] • (33) , (73) 

which shows how the Thomas rotation obstructs associativity. The general identity 
obtained from equating the rotational parts of (I72bt and (I72ct does not interest us here. 
Rather, we wish to consider the special case where pi = P3. Then the product (1721 
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is a symmetric and positive definite matrix^^, that is, it is a pure boost and therefore 
(trivially) polar decomposed. Hence the rotational part in (I72bt must be the identity. 
This gives 

T[|3i,|32] =T[|3i,|32*|3i] =T[|3i*|32,|32], (74) 
where the second equality follows from the first by simultaneously taking the inverse 
and exchanging pi and ^2 (which leaves T[(3i, (32] invariant according to d60l). 
Now consider the following equation in pi , (32, and (33: 

(3i * (32 = (33 . (75a) 

Can we (uniquely) solve it for (3i given (32 and P3, or for (32 given (3i and (33? Since 
each (3 has an inverse, associativity would immediately answer this in the affirmative. 
But associativity fails to hold. However, the answer is still affirmative: 

Proposition 3. The unique solutions of \75a\ for (3i and (32 are given by 

(3i = (33*(-T[(33,(32]-(32), (75b) 

(32 = (-(3i)*(33. (75c) 



Proof. M5c\ immediately follows from ^-multiplying (I75al) with — (3i from the left 
and using dTSl . taking into account that T[— (3i , (3i] = I3. The proof of (I75bl) is more 
difficult. One way that is not just 'guessing and verifying', but rather arrives at the 
solution in a more systematic fashion, is to go back to the group level and consider the 
corresponding equation 

L((3i,Di)-L(P2,D2) = L((33,D3), (76a) 

whose parameter form is 

(33 = (3i*Di-(32, (76b) 

D3 = T[(3i,Di • (32] -Di -02. (76c) 

The group structure now tells us that the unique solution for L( (3 1 , Dt ) is, using d67t . 

L(Pi,Di) = L(p3,D3) • L(-D2^ • ^2,02^ (77a) 

whose parameter form is 

(3i = (33*(-D3-D2^ -(32), (77b) 

Di = T[(33 , - D3 • D2 ^ • P2] • D3 • D2 ^ . (77c) 

Due to the group structure (1761 and dTTl are equivalent. In particular, dTTl is a conse- 
quence of (1761 . We now specialize to the case Di = I3, in which (I76bt just becomes 
(I75at . Equation (I76ct then becomes 

D3-D2^ = T[(3i,(32] 

= T[Pi * (32 , (32] using dllli 

= T[(33,(32] using (|76bll. (78) 

Inserting this into (I77bt gives (I75bt . □ 

For matrices it is generally true that if B is positive definite and A invertible, then A ■ B ■ A* is again 
positive definite. Note that here A* is the adjoint of A with respect to the Euclidean inner product. 
For then we could e.g. ★-multiply <75at with —pi from the right and get on the left hand side (pi ★ 

|32)*{-P2) = |3i ★ (132* {-Pi]) = P,. 
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Let us relate these findings to some algebraic terminology. A groupoid is a set 
S with some map c() : S x S — > S. Hence -k makes the open unit ball B-\ C 
into a groupoid. An associative groupoid is called a semigroup (so we don't have a 
semigroup) . A groupoid S is called a quasigroup if for any pair (a, b) G S x S 
there is a unique pair (x,y) G S x S such that ct)(x, a) = b and c()(a,y) = b. In 
our case we have just seen that the unique pair (x,y) associated to (a,b) = ((3i,|32) 
is X = (32 * (— T[(32, pi] • (3i) and y = (— |3i) -k (32. If a common unit element 
exists, as in dTOl . one calls it a quasigroup with unit or simply a loop. Note that in 
this case the existence of a unique inverse for each element follows. In some sense 
a loop is as close as you can get to the structure of a group if you drop associativity. 
This is the algebraic structure of velocity space in SR. Much original work on this has 
been done by A. Ungar, starting with |46|, where e.g. the precise way in which strict 
associativity fails (i.e. dTSb ) was first spelled out; see also his comprehensive treatise 
l47l and references therein. In a more recent book l48l the same author systematically 
develops the intimate relation to hyperbolic geometry. A brief history of the research 
on these generalized algebraic structures is given in f42'|. 

Let us briefly come back to the composition formulae (1751 . We interpret (3i, (32, 
and (33 as velocities of frames: (3i is the velocity of frame 2 with respect to (i.e. mea- 
sured in) frame 1. (32 is the velocity of frame 3 with respect to frame 2. Finally, (33 is 
the velocity of frame 3 with respect to frame 1. Then, using (I56al) . it is easy to derive 
the following expressions for the moduli of (33 and ^2- 

Pi = i^l±ljl^l|liiM, (79a) 

lfo-P,)^-lg3xg,)^ (79b) 
(l-fo-P,)^ 

(32 is the modulus of the relative velocity between frames 2 and 3 as function of the 
velocities of these frames with respect to a third one (here frame 1). It may either be 
interpreted as velocity of frame 3 with respect to frame 2, (as above) or as velocity of 
frame 2 with respect to frame 3 (reciprocity of frame velocities, see Sect.|3ll. Accord- 
ingly, the right hand side of (I79bl) is symmetric under the exchange (3i f-) (33. 



4.6 The geometric structure of velocity composition 

Even though the discussion of the geometry behind velocity composition belongs, 
strictly speaking, to the next, the geometry section, it is so intimately related to the 
discussion just given that it seems more appropriate to place the two right next to each 
other. 

More precisely, the composition law for velocities is intimately related with hyper- 
bolic geometry (i.e. geometry on spaces with constant negative curvature), as was first 
pointed out by Sommerfeld [43], Varicak |5llllll3, Robb and Borel im. More 
recently the subject was elaborated on by Ungar ll48l . The general reason is that the 
space of four-velocities 

He := {u G I g(u,u) = c^} C M"* , (80) 
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is a 3-dimensional hyperbola in (M'^.g), whose induced metric, he, is of constant 
negative curvature. Since Tic is spacelrke, we obtain the Riemannian metric he by 
restricting — g to the tangent bundle of Tic)- It is easy to write down he in terms of the 
coordinates p G ;Bi (cf. (l69t). For this write u = 07(1 , (3) and set dp = d|3|| + d(3^, 
where 

dP|| := |3(P • d|3)/|3^, (81a) 
dp^ := dp - dP|| . (81b) 

Then we have 

du = cy^(P-dP||, dP||+y"^dp^), (82) 
so that the Riemannian metric on the unit hyperbola is given by 

h := c^^he := -c-^ g ^v du^ du^ = dpf + dpi . (83) 

Introducing spherical angular coordinates in the usual fashion this can be written in 
various standard forms, depending on the choice of the radial coordinate: 

+,^i_(de2 + sin2ed(p2) (84a) 



(l-P^F ■ 1-P 
dR^ 



+ R^(de^ + sin2ed(p2) (84b) 

jdr^ + (de^ + sin^ 9 dcp^) } (84c) 
dp^ + sinh^p(de^ + sin^ed(p^) , (84d) 



1 +R2 



where 



R := ^=E^= with range [0,cxd), (85a) 

r := 1^ with range [0,1), (85b) 

p := tanh^^ p with range [0,oo). (85c) 

The forms (I84bl84dt correspond to three standard ways of writing a metric of constant 
sectional curvature —1, familiar e.g. from relativistic cosmology, where they appear 
as spatial part of the k = — 1 Friedmann-Robertson- Walker metric. From (I84dt we 
learn that the rapidity p (cf. (l26l ) is just the geodesic distance'^ on the hyperbola "He 
of four-velocities with respect to the rescaled metric h = c^^hc. This explains why 
the composition of velocities in the same direction is just ordinary addition if written 
in terms of rapidities; compare the remark following equation d26l . 

The relation between <84b> and the conformally flat form <84ct is given by R = 2t/{1 — r^] and 
T = R/(1 +yTTR2). 

In EicJ p appears as geodesic distance to the center of the chosen system of spherical polar coordi- 
nates. However, since Lor^ acts as a transitive group of isometries on Ti.^, <84ct is valid no matter 
what point on Tic is chosen for the center of coordinates. 
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The metric h, is also inherent in formula ( I79bt . which may also be read as endowing 
B-\ with a Riemannian metric when applied to two infinitesimally nearby velocities 
(3 = pi and |3 + dp = Pa. Then p2 gives us the square of their distance, ds^, for 
which we obtain 

ds^ = i£^liii4E=/dPl + y^dPi (86) 

which coincides with (I83t . Equation (I79bt is the starting point in Fock's discussion 
of the hyperbolic geometry of velocity space ([181 § 17). We also draw attention to a 
recent pedagogical discussion in |40| and the systematic treatment in Ch. 7 of |48 1. 

In this geometric setting the law i79al for the modulus of the composed velocities 
just turns into the law for the length of the third side of a geodesic triangle as function 
of the length of the two other sides and the angle between them. This is most easily 
read off from (I56al) if rewritten in terms of rapidities, i.e. yi = cosh pt and ptyi = 
sinh Pi. 

cosh P3 = cosh pi cosh p-\ + sinh pi sinh p2 cos (p . (87) 

This is just the well known 'cosine-law' for hyperbolic triangles, the connection of 
which with the law of composing velocities in SR was first pointed out by Sommer- 
feld 1431 and later, independently, by Borel flTl . Again we refer to ll48l for a modern 
and comprehensive treatment. 

A beautiful application of the hyperbolic geometry of velocity space dSOt concerns 
Thomas rotation |49|. Suppose a torque-free gyro is carried along the worldline z{r) 
of an observer. The hodograph is the curve z(t) on "He and zIt)-*- can be identified 
with the tangent plane to Tic at z(t). At each instant the gyro's angular-momentum 
vector lies in this tangent plane and along the worldline it is Fermi-Walker transported. 
We recall that given a vector field X along the worldline z, the Fermi-Walker derivative 
of X along z is defined by 

F^X:=(V^X||)|| + (VtX^)^, (88) 

where || and _L denote the g-orthogonal projections parallel and perpendicular to the 
worldline's tangent direction z. Applied to the gyro's angular momentum vector one 
sees that the law of Fermi-Walker transportation along z turns into the law of parallel 
propagation along the hodograph on Tic with respect to the Levi-Civita connection for 
the hyperbolic metric that Tic inherits from its embedding into Minkowski space. 
Applied to spatially periodic orbits the holonomy of their closed hodographs in the 
tangent bundle of Tic is then just Thomas' rotation. This neat geometric idea goes back 
to Borel [ 1 1 1, who sketched it almost 15 years before Thomas' paper [44 1 appeared. 

5 Geometric structures in Minkowski space 
5.1 Preliminaries 

Let us generally consider n dimensional Minkowski space M^, that is, the affine space 
over an n-dimensional, real vector space V with a non-degenerate bilinear form g of 

Generally, the Levi-Civita covariant derivative of a submanifold is obtained from the (covariant) deriva- 
tive of the ambient manifold by restricting it to tangent vectors and subsequently projecting the result 
tangentially to the submanifold. 
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signature (l,n — 1) (compare Sec. lA.7l and Sec. lA.2l respectively'). We introduce the 
following notations: 

v-w:=g(v,w) and ||v||g := \/|g(v,v]| . (89) 

We shall also simply write for v • v. A vector v G V is called timelike, lightlike, or 
spacelike according to being > 0, = 0, or < respectively. Non-spacelike vectors 
are also called causal and their set, C C V, is called the causal-doublecone. Its interior, 
C, is called the chronological-doublecone and its boundary, £, the light-doublecone: 

C : = (v e V I > 0} , (90a) 
C : = {v G V I > 0} , (90b) 
£:= {vG V|v^ = 0}. (90c) 

A linear subspace V C V is called timelike, lightlike, or spacelike according 
to g I y , being indefinite, negative semi-definite but not negative definite, or negative 
definite respectively. Instead of the usual Cauchy-Schwarz-inequality we have 

v^w^ < (v • w)^ for span{v,w} timelike , (91a) 
v^w^ = (v • w)^ for span{v,w} lightlike , (91b) 
v^w^ > (v • w)^ for span{v,w} spacelike . (91c) 

Given a set W C V (not necessarily a subspace'*^), its g-orthogonal complement 
is the subspace 

W-^ := {v G V I V • w = 0, Vw G W} . (92) 

If V G V is lightlike then v G v-*-. In fact, v-*- is the unique lightlike hyperplane 
containing v. On the other hand, if v is timelike/spacelike V"*- is spacelike/timelike and 

V v-*-. 

Given any subset W C V, we can attach it to a point p in M"^: 

Wp := p + W := {p + w I w G W} . (93) 

In particular, the causal-, chronological-, and light-doublecones at p G are given 
by: 





= P + C, 


(94a) 




= p + c, 


(94b) 


^p 


= p + £. 


(94c) 



If W is a subspace of V then Wp is an affine subspace of over W. If W is time- 
, light-, or spacelike then Wp is also called time-, light-, or spacelike. Of particular 
interest are the hyperplanes Vp which are timelike, lightlike, or spacelike according to 
V being spacelike, lightlike, or timelike respectively. 

Two points p, q G M"^ are said to be timelike-, lightlike-, or spacelike separated if 
the line joining them (equivalently: the vector p — q) is timelike, lightlike, or spacelike 

By a 'subspace' of a vector space we always understand a sub vector-space. 
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respectively. Non-spacelike separated points are also called causally separated and the 
line though them is called a causal line. 

It is easy to show that the relation v~w<^v-w>0 defines an equivalence 
relation on the set of timelike vectors. (Only transitivity is non-trivial, i.e. if u • v > 
and V • w > then u • w > 0. To show this, decompose u and w into their components 
parallel and perpendicular to v.). Each of the two equivalence classes is a cone in V, 
that is, closed under addition and multiplication with positive numbers. Vectors in the 
same class are said to have the same time orientation. In the same fashion the relation 
v~w<^v-w>0 defines an equivalence relation on the set of causal vectors, 
with both equivalence classes being again cones. The existence of these equivalence 
relations is expressed by saying that M"^ is time orientable. Picking one of the two 
possible time orientations is then equivalent to specifying a single timelike reference 
vector, V*, whose equivalence class of directions may be called the. future. This being 
done we can speak of the future (or forward) (+) and past (or backward) (— ) cones: 

: = {v G C I V • V, ^ 0} , (95a) 
C± : = {v G C I V • V, ^ 0} , (95b) 
£± : = (v G £ I vv, ^ 0} . (95c) 

Note that = \J and T] = 0. Usually £+ is called the future and 
the past lightcone. Mathematically speaking this is an abuse of language since, 
in contrast to and C^, they are not cones: They are each invariant (as sets) under 
multiplication with positive real numbers, but adding to vectors in £^ will result in a 
vector in unless the vectors were parallel. 

As before, these cones can be attached to the points in M"^. We write in a straight- 
forward manner: 

C±:=p + C±, (96a) 
C±:=p + C±, (96b) 
£^ : = p + £± . (96c) 

The Cauchy-Schwarz inequalities W\\ result in various cases for generalized tri- 
angle inequalities. Clearly, for spacelike vectors, one just has the ordinary triangle 
inequality. But for causal or timelike vectors one has to distinguish the cases accord- 
ing to the relative time orientations. For example, for timelike vectors of equal time 
orientation, one obtains the reversed triangle inequality: 

||V+W||g > ||v||g+ ||w||g, (97) 

with equality iff v and w are parallel. It expresses the geometry behind the 'twin 
paradox'. 

Before we turn to the next section, we remark that any bijective map cj) : M"^ — > 
M"^ that satisfies d(p, q) = d(c|)(p), c()(q)), where d(p, q) := ||p — q ||g, is necessarily 
affine linear. This follows immediately from the corresponding statement for vector 
spaces, as given in Propositionl20l The results in the following section should be 
considered as strengthenings of this statement. 
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5.2 Causality relations and the Lorentz group 

The family of cones {C^ \ q G M"^} defines a partial order relation, denoted by > (cf. 
Sec. IA.lt . on spacetime as follows: p > q iff p G C^, i.e. iff p — q is causal and future 
pointing. Similarly, the family {C^ \ q G M"^} defines a strict partial order, denoted 
by > (cf. Sec. IA.lt : p > q iff p G C^, i.e. if p — q is timelike and future pointing. 
There is a third relation, called >, defined as follows: p > q iff p G £q , i.e. p is on the 
future lightcone at q. It is not a partial order due to the lack of transitivity, which, in 
turn, is due to the lack of the lightcone being a cone (in the proper mathematical sense 
explained above). Replacing the future (+) with the past (— ) cones gives the relations 
<, <, and <. 

It is obvious that the action of ILor^ (spatial reflections are permitted) on 
maps each of the six families of cones d96l into itself and therefore leave each of the 
six relations invariant. For example: Let p > q and f G ILor^, then (p — q)^ > and 
p — q future pointing, but also (f(p) — f(q))^ > and f (p) — f(q) future pointing, 
hence f(p) > f (q). Another set of 'obvious' transformations of leaving these 
relations invariant is given by all dilations: 

d(A,^):M^^M^, p^d(A,^)(p):=A(p-m) + m, (98) 

where A G M+ is the constant dilation-factor and m G M"^ the center. This follows 

from (dA,m(p)-dA,m(c|))^ = A^(p-q)^> (dA,m(p) - dA,m(q)) -v* = A(p-q)-v*, 
and the positivity of A. Since translations are already contained in ILor^, the group 
generated by ILor^ and all dx.m is the same as the group generated by ILor^ and all 
d-A.m for fixed m. 

A seemingly difficult question is this: What are the most general transformations of 
M"^ that preserve those relations? Here we understand 'transformation' synonymously 
with 'bijective map', so that each transformation f has in inverse 'Preserving 
the relation' is taken to mean that f and f^^ preserve the relation. Then the some- 
what surprising answer to the question just posed is that, in three or more spacetime 
dimensions, there are no other such transformations besides those already listed: 

Theorem 4. Let >- stand for any of the relations >,>,> and let f be a bijection of 
Wl^ with n > 3, such that p q implies f (p) >- f (q) and f^^ (p) >- f^^ (q). Then f 
is the composition of an Lorentz transformation in ILor^ with a dilation. 

Proof. These results were proven by A.D. Alexandrov and independently by B.C. Zee- 
man. A good review of Alexandrov's results is LI |; Zeeman's paper is [53 1. The re- 
striction to n > 3 is indeed necessary, as for n = 2 the following possibility exists: 
Identify with and the bilinear form g(z, z) = — y^, where z = (x,y]. Set 
u := X — y and v := x + y and define f : — > by f (u, v] := (h,(u), h,(v)), where 
h, : M — > M is any smooth function with h' > 0. This defines an orientation preserving 
diffeomorphism of which transforms the set of lines u = const, and v = const, 
respectively into each other. Hence it preserves the families of cones (I96at . Since 
these transformations need not be affine Unear they are not generated by dilations and 
Lorentz transformations. □ 

These results may appear surprising since without a continuity requirement one 
might expect all sorts of wild behavior to allow for more possibilities. However, a 
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little closer inspection reveals a fairly obvious reason for why continuity is implied 
here. Consider the case in which a transformation f preserves the families {C+ | q G 
M"^} and {C^ \ q G M^}. The open diamond-shaped sets (usually just called 'open 
diamonds'), 

U(p,q):=(C+nCq)U(C+nCp, (99) 

are obviously open in the standard topology of M"^ (which is that of M^). Note that at 
least one of the intersections in d99l is always empty. Conversely, is is also easy to see 
that each open set of M"^ contains an open diamond. Hence the topology that is defined 
by taking the U(p, q) as subbase (the basis being given by their finite intersections) is 
equivalent to the standard topology of M". But, by hypothesis, f and f^^ preserves 
the cones Cq and therefore open sets, so that f must, in fact, be a homeomorphism. 

There is no such obvious continuity input if one makes the strictly weaker require- 
ment that instead of the cones ( l96t one only preserves the doublecones ( l94t . Does that 
allow for more transformations, except for the obvious time reflection? The answer is 
again in the negative. The following result was shown by Alexandrov (see his review 
Q) and later, in a different fashion, by Borchers and Hegerfeld fTOl : 

Theorem 5. Let ~ denote any of the relations: p ~ q jj^ (p — q)^ > 0, p ~ q 
(p — q )^ > 0, or p ~ q zj^ (p — q )^ = 0. Let i be a bijection o/M"^ with n > 3, such 
that p ~ q implies f (p) ~ f (q) and f^^ (p] ~ f^^ (q). Then f is the composition of an 
Lorentz transformation in ILor with a dilation. 

All this shows that, up to dilations, Lorentz transformations can be characterized 
by the causal structure of Minkowski space. Let us focus on a particular subcase 
of Theorem|5j which says that any bijection f of M"^ with n > 3, which satisfies 
Hp — q||g = <4 ||"f(p) — f(c|)||g = must be the composition of a dilation and a 
transformation in ILor. This is sometimes referred to Alexandrov 's theorem. It is, to 
my knowledge, the closest analog in Minkowskian geometry to the famous theorem of 
Beckman and Quarles jH, which refers to Euclidean geometry and reads as foUows^^: 

Theorem 6 (Beckman and Quarles 1953). Let for n > 2 be endowed with the 
standard Euclidean inner product {■ \ ■). The associated norm is given by \\x\\ := 
(x I x). Let 6 be any fixed positive real number and f : — > any map such that 
||x — y II = 6 ^ ||f ~ Hv)\\ = then f is a Euclidean motion, i.e. f G x: 0(ti]. 

Note that there are three obvious points which let the result of Beckman and Quar- 
les in Euclidean space appear somewhat stronger than the theorem of Alexandrov in 
Minkowski space: 

1. The conclusion of Theorem|6l holds for any 8 G M+, whereas Alexandrov's 
theorem singles out lightlike distances. 

" In fact, Beckman and Quarles proved the conclusion of Theorem|6|under slightly weaker hypotheses: 
They allowed the map f to be 'many- valued', that is, to be a map f : — > 5", where is the 
set of non-empty subsets of R'^, such that ||x — y || = 6 ||x' — y '|| =6 for any x' G f{x) and 
any y' £ f(y). However, given the statement of Theorem|6| it is immediate that such 'many-valued 
maps' must necessarily be single-valued. To see this, assume that x* £ R'^ has the two image points 
Tji ,1)2 and define ht : R'^ — > R" for i = 1,2 such that h] (x) = hifx) G f{x) for all x 7^ x« 
and h.i{x,) — yt. Then, according to Theorem|6| hi must both be Euclidean motions. Since they are 
continuous and coincide for all x 7^ x, , they must also coincide at x, . 
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2. In Theorem|6l n = 2 is not excluded. 



3. In Theorem|6l f is not required to be a bijection, so that we did not assume the 
existence of an inverse map f . CoiTcspondingly, there is no assumption that 
f also preserves the distance 8. 

5.3 Einstein synchronization 

We start by characterizing those cases in which a strict inverted Cauchy-Schwarz in- 
equaUty holds: 

Lemma 7. Let V be of dimension n > 2 and v ^ \ he some non-zero vector. The 
strict inverted Cauchy-Schwarz inequality, 

v^w^ < (v • w)^ , (100) 

holds for allw£V linearly independent ofv iffv is timelike. 

Proof. Obviously v cannot be spacelike, for then we would violate (llOOt with any 
spacelike w. If v is lightlike then w violates dlOOt iff it is in the set V"*- — span{v}, 
which is non-empty iff n > 2. Hence v cannot be lightlike if n > 2. If v is timelike 
we decompose w = av + w' with w' G V"*- so that w'^ < 0, with equality iff v and w 
are linearly dependent. Hence 

(v • w]^-vV = -v^w'^ > 0, (101) 

with equality iff v and w are linearly dependent. □ 

The next Lemma deals with the intersection of a causal line with a light cone, a situa- 
tion depicted in Fig.|2l 

Lemma 8. Let £p be the light-doublecone with vertex p and £ := {r + Av | r G M} Z^e 
a non-spacelike line, i.e. v^ > 0, through r Cp. Ifv is timelike I D Cp consists of 
two points. Ifv is lightlike this intersection consists of one point ifp—r^v-^ and is 
empty if'p — r G V"*-. Note that the latter two statements are independent of the choice 
ofr G I — as they must be — , i.e. are invariant under r h-> t' := r + o^, where a G K. 

Proof. We have r + Av G £p iff 

(r + Av - p)^ = <^ aV + 2Av • (r - p] + (r - p)^ = . (102) 
For V timelike we have v^ > and (11021) has two solutions 

Ai,2 = ^|-v-(r-p)±^(v-(r-p))^-v2(r-p)2| . (103) 

Indeed, since r £p, the vectors v and r — p cannot be linearly dependent so that 
LemmaQimplies the positivity of the expression under the square root. If v is lightlike 
(I102t becomes a linear equation which is has one solution if v • (r — p) 7^ and no 
solution if v • (t — p) = [note that (t — p)^ / since q £p by hypothesis]. □ 
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Figure 2: A timelike line £ = {r+ Av | A G M} intersects the light-cone with vertex p 
£ in two points: q+, its intersection with the future light-cone and q_, its intersection 
with past the light cone, q is a point inbetween q+ and q_. 

Proposition 9. Let I and C-p as in Lemma^with v timelike. Let q+ and q^be the two 
intersection points of I with C-p and q G £ a point between them. Then 

l|q -p||g = l|q+- q||g l|q - q-llg- (i04) 

Moreover, ||q+ — q||g = ||q — q_||g ijf^ — q is perpendicular to v. 

Proof. The vectors (q + -p) = (q-p) + (q+-q) and (q_-p) = (q-p) + (q_-q) 
are lightlike, which gives (note that q — p is spacelike): 

I|q-Pllg = -(q-p)^ = (q+-q)^ + 2(q-p)-(q+-q), (105a) 
I|q-Pllg = -(q-p)^ = (q--q)^ + 2(q-p)-(q_-q). (105b) 

Since q+ — q and q — q_ are parallel we have q_|_ — q = A(q — q_) with A G M+ so 
that (q+-q)^ = A||q+ - q||g||q - q-||g andA(q - q )^ = ||q+- q||g||q - q_||g. 
Now, multiplying (I105bt with A and adding this to (I105at immediately yields 

(1 +A)||q-p||^ = (1 +A) ||q+-q||g||q-q_||g. (106) 

Since 1 + A / this implies (I104t . Finally, since q + — q and q_ — q are antiparallel, 
||q+ — q||g = ||q_ — q||g iff (q+ — q) = — (q- — q). Equations (11051) now show that 
this is the case iff (q — p) • [q± — q) =0, i.e. iff (q — p) • v = 0. Hence we have 
shown 

||q+~ q||g = ||q - q_||g ^ (q-p)-v = 0. (107) 

In other words, q is the midpoint of the segment q+q_ iff the line through p and q is 
perpendicular (wrt. g) to I. □ 
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The somewhat surprising feature of the first statement of this proposition is that 
(I104t holds for any point of the segment q+q_, not just the midpoint, as it would have 
to be the case for the corresponding statement in Euclidean geometry. 

The second statement of Proposition|9l gives a convenient geometric characteriza- 
tion of Einstein-simultaneity. Recall that an event q on a timelike line I (representing 
an inertial observer) is defined to be Einstein-simultaneous with an event p in space- 
time iff q bisects the segment q_|_q_ between the intersection points q+, q_ of £ with 
the double-lightcone at p. Hence Proposition|9limplies 

Corollary 10. Einstein simultaneity with respect to a timelike line I is an equivalence 
relation on spacetime, the equivalence classes of which are the spacelike hyperplanes 
orthogonal (wrt. g) to L 

The first statement simply follows from the fact that the family of parallel hyper- 
planes orthogonal to I form a partition (cf. Sect. lA. il l of spacetime. 

From now on we shall use the terms 'timelike line' and 'inertial observer' synony- 
mously. Note that Einstein simultaneity is only defined relative to an inertial observer 
Given two inertial observers, 

£ = {r + Av I A G M} first observer , (108a) 

I' = [r' + A'v' I A' G R} second observer , (108b) 

we call the corresponding Einstein-simultaneity relations l-simultaneity and V- 
simultaneity . Obviously they coincide iff I and V are parallel (v and v' are linearly 
dependent). In this case q' G £' is ^-simultaneous to q G £ iff q G £ is £ '-simultaneous 
to q' G If £ and £'are not parallel (skew or intersecting in one point) it is generally 
not true that if q' G is ^-simultaneous to q G £ then q G £ is also £ '-simultaneous to 
q ' G V . In fact, we have 

Proposition 11. Let I and V two non-parallel timelike likes. There exists a unique 
pair (q,q') G I x V so that q' is i- simultaneous to q and q is V simultaneous to q'. 

Proof. We parameterize I and £' as in (I108t . The two conditions for q' being l- 
simultaneous to q and q being £ '-simultaneous to q' are (q — q') - v = = (q — q') - v'. 
Writing q = r + Av and q' = r' + A'v' this takes the form of the following matrix 
equation for the two unknowns A and A': 

(::- - i^) - (,';:^ ::-) ■ 

This has a unique solution pair (A, A'), since for linearly independent timelike vectors 
V and v' LemmaQ implies (v • v')^ — v^v'^ > 0. Note that if £ and £' intersect 
q = q ' = intersection point. □ 

Clearly, Einstein-simultaneity is conventional and physics proper should not de- 
pend on it. For example, the fringe-shift in the Michelson-Morley experiment is in- 
dependent of how we choose to synchronize clocks. In fact, it does not even make 
use of any clock. So what is the general definition of a 'simultaneity structure'? It 
seems obvious that it should be a relation on spacetime that is at least symmetric (each 
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event should be simultaneous to itself). Going from one-way simultaneity to the mu- 
tual synchronization of two clocks, one might like to also require reflexivity (if p is 
simultaneous to q then q is simultaneous to p), though this is not strictly required in 
order to one-way synchronize each clock in a set of clocks with one preferred 'master 
clock' , which is sufficient for many applications. 

Moreover, if we like to speak of the mutual simultaneity of sets of more than two 
events we need an equivalence relation on spacetime. The equivalence relation should 
be such that each inertial observer intersect each equivalence class precisely once. Let 
us call such a simultaneity structure 'admissible'. Clearly there are zillions of such 
structures: just partition spacetime into any set of appropriate^^ spacelike hypersur- 
faces (there are more possibilities at this point, like families of forward or backward 
Ughtcones). An absolute admissible simultaneity structure would be one which is in- 
variant (cf. Sect. lA. il l under the automorphism group of spacetime. We have 

Proposition 12. There exits precisely one admissible simultaneity structure which is 
invariant under the inhomogeneous proper orthochronous Galilei group and none that 
is invariant under the inhomogeneous proper orthochronous Lorentz group. 

Proof. See (El- □ 

There is a group-theoretic reason that highlights this existential difference: 

Proposition 13. Let G be a group with transitive action on a set S. Let Stab(p] C G 
be the stabilizer subgroup for p S S f due to transitivity all stabilizer subgroups are 
conjugate). Then S admits a G-invariant equivalence relation R C S X S /^Stab(p) 
is not maximal, that is, jj^Stab(p) is properly contained in a proper subgroup WofG: 
Stab(p) C H C G. 

Proof. See Theorem 1.12 in ll33t. □ 

Regarding the action of the inhomogeneous Galilei and Lorentz groups on space- 
time their stabilizers are the corresponding homogeneous groups. As already discussed 
at the end of Sect. 14.11 the homogeneous Lorentz group is maximal in the inhomoge- 
neous one, whereas the homogeneous Galilei group is not maximal in the inhomo- 
geneous one. This, according to Proposition^J is the group theoretic origin of the 
absence of any invariant simultaneity structure in the Lorentzian case. 

5.4 The lattice structure of causally and chronologically complete sets 

Here we wish to briefly discuss another important structure associated with causality 
relations in Minkowski space, which plays a fundamental role in modern Quantum 
Field Theory (see e.g. 128J). Let Si and S2 be subsets of M"^. We say that Si and S2 
are causally disjoint or spacelike separated iff pi — P2 is spacelike, i.e. (pi — pi)^ < 0, 
for any pi G Si and p2 G S2. Note that because a point is not spacelike separated 
from itself, causally disjoint sets are necessarily disjoint in the ordinary set-theoretic 
sense — the converse being of course not true. 

For example, the hypersurfaces should not be asymptotically hyperboloidal, for then a constantly ac- 
celerated observer would not intersect all of them. 
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For any subset S C M"^ we denote by S ' the largest subset of M"^ which is causally 
disjoint to S. The set S' is called the causal complement of S. The procedure of taking 
the causal complement can be iterated and we set S" := [S')' etc. S" is called the 
causal completion of S. It also follows straight from the definition that Si C S2 
implies S^' 3 S2 and also S" 5 S. If S" = S we call S causally complete. We 
note that the causal complement S ' of any given S is automatically causally complete. 
Indeed, from S" 5 S we obtain (S')" C S', but the first inclusion applied to S' 
instead of S leads to [S')" 5 S', showing (S')" = S'. Note also that for any subset S 
its causal completion, S", is the smallest causally complete subset containing S, for if 
S C K C S" with K" = K, we derive from the first inclusion by taking " that S" C K, 
so that the second inclusion yields K = S". Trivial examples of causally complete 
subsets of are the empty set, single points, and the total set M^. Others are the 
open diamond-shaped regions ( l99l as well as their closed counterparts: 



We now focus attention to the set Caus(M"^) of causally complete subsets of M^, 
including the empty set, 0, and the total set, M"^, which are mutually causally comple- 
mentary. It is partially ordered by ordinary set-theoretic inclusion (C) (cf. Sect. lA. il l 
and carries the 'dashing operation' (') of taking the causal complement. Moreover, 
on Caus(M"^) we can define the operations of 'meet' and 'join', denoted by A and 
V respectively, as follows: Let Si S Caus(M"^) where i = 1 ,2, then Si A S2 is the 
largest causally complete subset in the intersection Si n S2 and Si V S2 is the smallest 
causally complete set containing the union Si U S2. 

The operations of A and V can be characterized in terms of the ordinary set- 
theoretic intersection n together with the dashing-operation. To see this, consider two 
causally complete sets, St where i = 1 , 2, and note that the set of points that are space- 
like separated from Si and S2 are obviously given by Si n S2, but also by (Si U S2)', 
so that 



Here (llllat and (llllbt are equivalent since any Si G Caus(M"^) can be written as 
Si = P{, namely Pi = S[. If Si runs through all sets in Caus(M"^) so does Pi. Hence 
any equation that holds generally for all Si G Caus(M"^) remains valid if the Si are 
replaced by S{. 

Equation (llllbt immediately shows that Si n S2 is causally complete (since it is 
the ' of something). Taking the causal complement of (llllat we obtain the desired 
relation for Si V S2 := (Si U S2)". Together we have 



U(p, q) := (C+ n Cq ) U (C+ n Cp ) . 



(110) 



s^ns^ = (S1US2)', 
Si nS2 = (s; us^)'. 



(Ilia) 
(111b) 



Si AS2 = Si nS2, 
Si VS2 = (s^ ns^)'. 



(112a) 
(112b) 



From these we immediately derive 



(Si AS2)' = s;vs^, 

(Si VS2)' = S^ AS^. 



(113a) 
(113b) 
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All what we have said so far for the set Caus(M^] could be repeated verbatim 
for the set Chron(M"^) of chronologically complete subsets. We say that Si and S2 
are chronologically disjoint or non-timelike separated, iff Si Pi S2 = and (pi — 
Pl)^ < for any Pi £ Si and p2 G S2. S', the chronological complement of S, 
is now the largest subset of which is chronologically disjoint to S. The only 
difference between the causal and the chronological complement of S is that the latter 
now contains lightlike separated points outside S. A set S is chronologically complete 
iff S = S", where the dashing now denotes the operation of taking the chronological 
complement. Again, for any set S the set S ' is automatically chronologically complete 
and S" is the smallest chronologically complete subset containing S. Single points 
are chronologically complete subsets. All the formal properties regarding A, and V 
stated hitherto for Caus(M"^] are the same for Chron(M"^). 

One major difference between Caus(M"^) and Chron(M^) is that the types of 
diamond-shaped sets they contain are different. For example, the closed ones, (IllOt . 
are members of both. The open ones, (l99l . are contained in Caus(M"^) but not 
in Chron(M"^). Instead, Chron(M"^) contains the closed diamonds whose 'equa- 
tor' have been removed. An essential structural difference between Caus(M"^) and 
Chron(M"^) will be stated below, after we have introduced the notion of a lattice to 
which we now turn. 

To put all these formal properties into the right frame we recall the definition of a 
lattice. Let (L, <) be a partially ordered set and a, b any two elements in L. Synony- 
mously with a < b we also write b > a and say that a is smaller than b, b is bigger 
than a, or b majorizes a. We also write a < b if a < b and a / b. If, with respect to 
<, their greatest lower and least upper bound exist, they are denoted by a A b — called 
the 'meet of a and b' — and a V b — called the 'join of a and b' — respectively. A par- 
tially ordered set for which the greatest lower and least upper bound exist for any pair 
a, b of elements from L is called a lattice. 

We now list some of the most relevant additional structural elements lattices can 
have: A lattice is called complete if greatest lower and least upper bound exist for any 
subset K C L. If K = L they are called (the smallest element in the lattice) and 1 
(the biggest element in the lattice) respectively. An atom in a lattice is an element a 
which majorizes only 0, i.e. < a and if < b < a then b = or b = a. The 
lattice is called atomic if each of its elements different from majorizes an atom. An 
atomic lattice is called atomistic if every element is the join of the atoms it majorizes. 
An element c is said to cover a if a < c and if a < b < c either a = b or b = c. An 
atomic lattice is said to have the covering property if, for every element b and every 
atom a for which a A b = 0, the join a V b covers b. 

The subset {a, b, c} C L is called a distributive triple if 

aA(bVc) = (aAb)V(aAc) and (a, b, c) cyclically permuted , (114a) 
aV (b Ac) = (a Vb) A (aVc) and (a, b, c) cyclically permuted . (114b) 

Definition 5. A lattice is called distributive or Boolean if every triple {a, b, c} is dis- 
tributive. It is called modular if every triple {a, b, c} with a < b is distributive. 

^' By 'equator' we mean the (n — 2)-sphere in which the forward and backward light-cones in <110t 
intersect. In the two-dimensional drawings the 'equator' is represented by just two points marking the 
right and left corners of the diamond-shaped set. 
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It is straightforward to check from (I114t that modularity is equivalent to the fol- 
lowing single condition: 

modularity <4> aV(b Ac) =b A (aVc) for all a,b,c G L s.t. a < b. (115) 

If in a lattice with smallest element and greatest element 1 a map L — ) L, a h-> a', 
exist such that 

a" := [a')' = a, (116a) 
a<b^b'<a', (116b) 
aAa' = 0, aVa' = l, (116c) 

the lattice is called orthocomplemented. It follows that whenever the meet and join of 
a subset {ai | i G 1} (I is some index set) exist one has De Morgan's laws^^: 

(Aieiai)' = Vieia(, (117a) 
(Vteiai)' = Aiei<- (117b) 

For orthocomplemented lattices there is a still weaker version of distributivity than 
modularity, which turns out to be physically relevant in various contexts: 

Definition 6. An orthocomplemented lattice is called orthomodular if every triple 
{a, b, c} with a < b and c < b' is distributive. 

From (II 15t and using that bAc = Oforb<c' one sees that this is equivalent to 
the single condition (renaming c to c'): 

orthomod. a = bA(aVc') for all a, b, c G L s.t. a < b < c, (118a) 
^ a = bV(aAc') for all a, b, c G L s.t. a > b > c, (118b) 

where the second line follows from the first by taking its orthocomplement and re- 
naming a', b', c to a, b, c'. It turns out that these conditions can still be simplified by 
making them independent of c. In fact, (I118t are equivalent to 

orthomod. <4 a = bA(aVb') for all a, b G L s.t. a < b , (119a) 
^ a = bV(aAb') for all a, b G L s.t. a > b . (119b) 

It is obvious that (II 18t implies (I119t (set c = b). But the converse is also true. To see 
this, take e.g. (I119bt and choose any c < b. Then c' > b', a > b (by hypothesis), 
and a > a A c' (trivially), so that a > b V (a A c')- Hence a > b V (a A c') > 
b V (a A b') = a, which proves (I118bt . 

Complete orthomodular atomic lattices are automatically atomistic. Indeed, let b 
be the join of all atoms majorized by a 7^ 0. Assume a 7^ b so that necessarily b < a, 
then ( I119bt implies a A b' 7^ 0. Then there exists an atom c majorized by a A b'. 
This implies c < a and c < b', hence also c ^ b. But this is a contradiction, since b 
is by definition the join of all atoms majorized by a. 

From these laws it also appears that the definition <116ct is redundant, as each of its two statements 
follows from the other, due to ' = 1 . 
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Finally we mention the notion of compatibility or commutativity, which is a sym- 
metric, reflexive, but generally not transitive relation R on an orthomodular lattice (cf. 
Sec. IA.lt . We write al^b for (a, b) G R and define: 

a^b ^ a = (aAb) V(aAb'), (120a) 
^ b = (bAa) V(bAa'). (120b) 

The equivalence of these two lines, which shows that the relation of being compatible 
is indeed symmetric, can be demonstrated using orthomodularity as follows: Suppose 
(fT20al holds; then b A a' = b A (b' V a') A (b V a') = b A (b' V a'), where we 
used the orthocomplement of (I120al) to replace a' in the first expression and the trivial 
identity bA(bVa') = bin the second step. Now, applying ( I119bl ) to b > a A b we 
get b = (b A a) V [b A (b' V a')] = (b A a) V (b A a'), i.e. (fT20bl . The converse, 
(I120bll ^ Ill20all , is of course entirely analogous. 

From (I120t a few things are immediate: at]b is equivalent to at]b', at]b is implied 
by a < b or a < b', and the elements and 1 are compatible with all elements in 
the lattice. The center of a lattice is the set of elements which are compatible with 
all elements in the lattice. In fact, the center is a Boolean sublattice. If the center 
contains no other elements than and 1 the lattice is said to be irreducible. The other 
extreme is a Boolean lattice, which is identical to its own center. Indeed, if ( a, b , b ' ) is 
adistributivetriple, onehas a = aAl = aA(bVb') = (aAb)V(aAb') =^ lfT20all . 

After these digression into elementary notions of lattice theory we come back to 
our examples of the sets Caus(M^) Chron(M^). Our statements above amount to say- 
ing that they are complete, atomic, and orthocomplemented lattices. The partial order 
relation < is given by C and the extreme elements and 1 correspond to the empty 
set and the total set M"^, the points of which are the atoms. Neither the covering 
property nor modularity is shared by any of the two lattices, as can be checked by way 
of elementary counterexamples.^^ In particular, neither of them is Boolean. However, 
in 1 15] it was shown that Chron(M^) is orthomodular; see also fljil which deals with 
more general spacetimes. Note that by the argument given above this implies that 
Chron(M"^) is atomistic. In contrast, Caus(M") is definitely not orthomodular, as is 
e.g. seen by the counterexample given in Fig.lSj^"* It is also not difficult to prove that 
Chron(M"^) is irreducible.^^ 

It is well known that the lattices of propositions for classical systems are Boolean, 
whereas those for quantum systems are merely orthomodular. In classical physics the 

An immediate counterexample for the covering property is this: Take two timehke separated points 
(i.e. atoms) p and q. Then {p| A {q} = whereas {p} V {q} is given by the closed diamond <110L 
Note that this is true in Caus(M'^) and Chron(M'^). But, clearly, {p} V {q} does not cover either {p} 
or{q}. 

^* Regarding this point, there are some conflicting statements in the literature. The first edition of | 28J 
states orthomodularity of Chron(M'^) in Proposition 4. 1.3, which is removed in the second edition 
without further comment. The proof offered in the first edition uses jl 19at as definition of orthomod- 
ularity, writing Ki for a and K2 for b. The crucial step is the claim that any spacetime event in the 
set K2 A (K] V K2) lies in K2 and that any causal line through it must intersect either K] or K2. The 
last statement is, however, not correct since the join of two sets (here Ki and K2) is generally larger 
than the domain of dependence of their ordinary set-theoretic union; compare Fig.|3| : (Generally, the 
domain of dependence of a subset S of spacetime M is the largest subset D(S) C M such that any 
inextensible causal curve that intersects D(S) also intersects S.) 

In general spacetimes M, the failure of irreducibility of Chron(M) is directly related to the existence 
of closed timelike curves; see 1131 . 
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Figure 3: The two figures siiow that Caus(M^) is not orthomodular. The first tiling 
to note is that Caus(M^) contains open d99b as well as closed il 101) diamond sets. In 
the left picture we consider the join of a small closed diamond a with a large open 
diamond b'. (Closed sets are indicated by a solid boundary line.) Their edges are 
aligned along the lightlike line L Even though these regions are causally disjoint, their 
causal completion is much larger than their union and given by the open (for n > 2) 
enveloping diamond a V b' framed by the dashed line. (This also shows that the join 
of two regions can be larger than the domain of dependence of their union; compare 
footnote|24]) . Next we consider the situation depicted on the right side. The closed 
double-wedge region b contains the small closed diamond a. The causal complement 
b ' of b is the open diamond in the middle, a Vb ' is, according to the first picture, given 
by the large open diamond enclosed by the dashed line. The intersection of aVb' with 
b is strictly larger than a, the difference being the dark-shaded region in the left wedge 
of b below a. Hence a/bA(aVb'),in contradiction to ( I119at . 

elements of the lattice are measurable subsets of phase space, with < being ordinary 
set-theoretic inclusion C, and A and V being ordinary set-theoretic intersection n and 
union U respectively. The orthocomplement is the ordinary set-theoretic complement. 
In Quantum Mechanics the elements of the lattice are the closed subspaces of Hilbert 
space, with < being again ordinary inclusion, A ordinary intersection, and V is given 
by a V b := span{a, b}. The orthocomplement of a closed subset is the orthogonal 
complement in Hilbert space. For comprehensive discussions see |34| and [5J. 

One of the main questions in the foundations of Quantum Mechanics is whether 
one could understand (derive) the usage of Hilbert spaces and complex numbers from 
somehow more fundamental principles. Even though it is not a priori clear what ones 
measure of fundamentality should be at this point, an interesting line of attack consists 
in deriving the mentioned structures from the properties of the lattice of propositions 
(Quantum Logic). It can be shown that a lattice that is complete, atomic, irreducible, 
orthomodular, and that satisfies the covering property is isomorphic to the lattice of 
closed subspaces of a linear space with Hermitean inner product. The complex num- 
bers are selected if additional technical assumptions are added. For the precise state- 
ments of these reconstruction theorems see 0. 

It is now interesting to note that, on a formal level, there is a similar transition in 
going from Galilei invariant to Lorentz invariant causality relations. In fact, in Galilean 
spacetime one can also define a chronological complement: Two points are chronolog- 
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ically related if they are connected by a worldline of finite speed and, accordingly, two 
subsets in spacetime are chronologically disjoint if no point in one set is chronologi- 
cally related to a point of the other. For example, the chronological complement of a 
point p are all points simultaneous to, but different from, p. More general, it is not hard 
to see that the chronologically complete sets are just the subsets of some t = const, 
hypersurface. The lattice of chronologically complete sets is then the continuous dis- 
joint union of sublattices, each of which is isomorphic to the Boolean lattice of subsets 
in R^. For details see 1 14|. 

As we have seen above, Chron(M"^) is complete, atomic, irreducible, and ortho- 
modular (hence atomistic). The main difference to the lattice of propositions in Quan- 
tum Mechanics, as regards the formal aspects discussed here, is that Chron(M^) does 
not satisfy the covering property. Otherwise the formal similarities are intriguing and 
it is tempting to ask whether there is a deeper meaning to this. In this respect it would 
be interesting to know whether one could give a lattice-theoretic characterization for 
Chron(M) (M some fixed spacetime), comparable to the characterization of the lat- 
tices of closed subspaces in Hilbert space alluded to above. Even for M = M"^ such a 
characterization seems, as far as I am aware, not to be known. 

5.5 Rigid motion 

As is well known, the notion of a rigid body, which proves so useful in Newtonian 
mechanics, is incompatible with the existence of a universal finite upper bound for 
all signal velocities f37'|. As a result, the notion of a perfectly rigid body does not 
exist within the framework of SR. However, the notion of a rigid motion does exist. 
Intuitively speaking, a body moves rigidly if, locally, the relative spatial distances of 
its material constituents are unchanging. 

The motion of an extended body is described by a normalized timelike vector field 
u : n — ) W^, where Q. is an open subset of Minkowski space, consisting of the events 
where the material body in question 'exists'. We write g(u, u) = u • u = for the 
Minkowskian scalar product. Being normalized now means that = (we do not 
choose units such that c = 1 ). The Lie derivative with respect to u is denoted by L^^. 

For each material part of the body in motion its local rest space at the event p G O 
can be identified with the hyperplane through p orthogonal to Up: 

Hp:=p + u^. (121) 

Up carries a Euclidean inner product, hp, given by the restriction of — g to Up . Gen- 
erally we can write 

h = c^^u'' ®u'' - g , (122) 

where u'' = g^(u) := g(u, •) is the one-form associated ('index-lowered', cf. 
Sec. lA.5t to u. Following L12J the precise definition of 'rigid motion' can now be 
given as follows: 

Definition 7 (Born 1909). Let u be a normalized timelike vector field u. The motion 
described by its flow is rigid if 

Uh = . (123) 
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Note that, in contrast to the Killing equations L^g = 0, these equations are non linear 
due to the dependence of h, upon u. 

We write TTh := id — c^^u^u'' G End(M"^) for the tensor field over spacetime that 
pointwise projects vectors perpendicular to u. It acts on one forms a via TThIcx) := 
a o TTh and accordingly on all tensors. The so extended projection map will still be 
denoted by TTh- Then we e.g. have 

h = -nHg:=-g(nH-,nH-). (124) 
It is not difficult to derive the following two equations:^^ 

UJx = fUh, (125) 
Uh = -U(nHg) = -nH(Ug) , (126) 

where f is any differentiable real- valued function on O. 

Equation (11251) shows that the normalized vector field u satisfies (I123t iff any 
rescaling fu with a nowhere vanishing function f does. Hence the normalization con- 
dition for u in (I123t is really irrelevant. It is the geometry in spacetime of the flow 
lines and not their parameterization which decide on whether motions (all, i.e. for any 
parameterization, or none) along them are rigid. This has be the case because, gener- 
ally speaking, there is no distinguished family of sections (hypersurfaces) across the 
bundle of flow lines that would represent 'the body in space', i.e. mutually simultane- 
ous locations of the body's points. Distinguished cases are those exceptional ones in 
which u is hypersurface orthogonal. Then the intersection of u's flow lines with the 
orthogonal hypersurfaces consist of mutually Einstein synchronous locations of the 
points of the body. An example is discussed below. 

Equation (I126t shows that the rigidity condition is equivalent to the 'spatially' 
projected Killing equation. We call the flow of the timelike normalized vector field 
u a Killing motion (i.e. a spacetime isometry) if there is a Killing field K such that 
u = cK/\/K^. Equation (11261) immediately implies that Killing motions are rigid. 
What about the converse? Are there rigid motions that are not Killing? This turns out 
to be a difficult question. Its answer in Minkowski space is: 'yes, many, but not as 
many as naively expected.' 

Before we explain this, let us give an illustrative example for a Killing motion, 
namely that generated by the boost Killing-field in Minkowski space. We suppress 
all but one spatial directions and consider boosts in x direction in two-dimensional 
Minkowski space (coordinates ct and x; metric ds^ = c^dt^ — dx^). The Killing field 

is27 

K = x9ct + ct9^, (127) 

which is timelike in the region |x| > |ct|. We focus on the 'right wedge' x > |ct|, which 
is now our region O. Consider a rod of length I which at t = is represented by the 

Equation <126> simply follows from LuTTh = -c^^u ® LuU^ so that g((LunH)X, riHY) = 
for all X,Y. In fact, LuU^ = where a := VuU is the spacetime-acceleration. This follows 
fromLuU^{X) = Lu(g(u, X)) - g(u, UX) = g(VuU, X) + g{u, VuX - [u, X]) = g(Q,X)- 
g(u, Vxu) = g(a, X), where g(u, u) = const, was used in the last step. 

Here we adopt the standard notation from differential geometry, where 9^ := 9/9x^ denote the vector 
fields naturally defined by the coordinates {x^}|j.=o - n-i • Pointwise the dual basis to {9n}^=o - n-i 
is {dx^^jV=o...n-i . 



42 



interval x G (r, r + 1), where r > 0. The flow of the normalized field u = cK/\/K^ is 

ct(T) = xo sinh(cT/xo) , (128a) 
x(t) = Xo cosh(cT/xo) , (128b) 

where xq = x(t = 0) G (t, r + I] labels the elements of the rod at t = 0. We have 
x^ — c^t^ = Xq, showing that the individual elements of the rod move on hyperbolae 
('hyperbolic motion'), t is the proper time along each orbit, normalized so that the rod 
lies on the x axis at t = 0. 
The combination 

A := ct/xo (129) 

is just the flow parameter for K (I127t . sometimes referred to as 'Killing time' (though 
it is dimensionless). From (I128t we can solve for A and t as functions of ct and x: 

A = f(ct,x) := tanh"^(ct/x) , (130a) 
T = f(ct,x] := ^(x/c)2 - t2 tanh-^ (ct/x) , (130b) 

xo/c 

from which we infer that the hypersurfaces of constant A are hyperplanes which all 
intersect at the origin. Moreover, we also have df = K^/K^ (d is just the ordinary 
exterior differential) so that the hyperplanes of constant A intersect all orbits of u 
(and K) orthogonally. Hence the hyperplanes of constant A qualify as the equivalence 
classes of mutually Einstein-simultaneous events in the region x > |ct| for a family of 
observers moving along the Killing orbits. This does not hold for the hypersurfaces of 
constant t, which are curved. 

The modulus of the spacetime-acceleration (which is the same as the modulus of 
the spatial acceleration measured in the local rest frame) of the material part of the rod 
labeled by xo is 

||a||g = cVxo. (131) 

As an aside we generally infer from this that, given a timelike curve of local acceler- 
ation (modulus) a, infinitesimally nearby orthogonal hyperplanes intersect at a spatial 
distance c^/a. This remark will become relevant in the discussion of part 2 of the 
Noether-Herglotz theorem given below. 

In order to accelerate the rod to the uniform velocity v without deforming it, its 
material point labeled by xq has to accelerate for the eigentime (this follows from 

(insj) 

T = — tanh""" (v/c) , (132) 
c 

which depends on xq. In contrast, the Killing time is the same for all material points 
and just given by the final rapidity. In particular, judged from the local observers 
moving with the rod, a rigid acceleration requires accelerating the rod's trailing end 
harder but shorter than pulling its leading end. 

In terms of the coordinates (A,xo), which are comoving with the flow of K, and 
(t, Xq), which are comoving with the flow of u, we just have K = 9/9A and u = 9/9t 
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respectively. The spacetime metric g and the projected metric h. in terms of these 
coordinates are: 

h, = dxg , (133a) 
g = Xq dA^ — dxg = c^(dT — (t/xq) dxo)^ — dxo . (133b) 

Note the simple form g takes in terms of xq and A, which are also called the 'Rindler 
coordinates' for the region |x| > |ct| of Minkowski space. They are the analogs in 
Lorentzian geometry to polar coordinates (radius xq, angle A) in Euclidean geometry. 

Let us now return to the general case. We decompose the derivative of the velocity 
one-form u'' := g-^(u) as follows: 

Vu^ = e + cu + c"^u^® a^ (134) 

where 9 and cu are the projected symmetrized and antisymmetrized derivatives respec- 
tively^^ 

29 = nH(VVu^) = VVu^-c^^u^ Va^ (135a) 
2cu = nH(VAu^) = VAu^-c^^u^ Aa^ (135b) 

The symmetric part, 9, is usually further decomposed into its traceless and pure trace 
part, called the shear and expansion of u respectively. The antisymmetric part cu is 
called the vorticity of u. 

Now recall that the Lie derivative of g is just twice the symmetrized derivative, 
which in our notation reads: 

L^g = VVu^ (136) 
This implies in view of (I123t . (I126t . and (I135al) 

Proposition 14. Let ube a normalized timelike vector field u. The motion described 
by its flow is rigid iffxi is of vanishing shear and expansion, i.e. iffQ =0. 

Vector fields generating rigid motions are now classified according to whether or 
not they have a vanishing vorticity cu: if cu = the flow is called irrotational, other- 
wise rotational. The following theorem is due to Herglotz l30ll and Noether l38]l : 

Theorem 15 (Noether & Herglotz, part 1). A rotational rigid motion in Minkowski 
space must be a Killing motion. 

An example of such a rotational motion is given by the Killing field^^ 

K = 9t+K9^, (137) 

We denote the symmetrized and antisymmetrized tensor- product (not including the factor 1/n!) by V 
and A respectively and the symmetrized and antisymmetrized (covariant-) derivative by VV and VA. 
For example, (u^ A v^)ab = UaVh — UbVa and ( VV u^)ab = VaUb -h VbUa. Note that (V Au'') 
is the same as the ordinary exterior differential du^ . Everything we say in the sequel applies to curved 
spacetimes if V is read as covariant derivative with respect to the Levi-Civita connection. 
We now use standard cylindrical coordinates (z, p, cp), in terms of which ds^ = c^dt^ — dz^ — 
dp^ - p^ dcp^ 
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inside the region 



a = {(t,z,p,(p) I Kp < c}, 



(138) 



where K is timelike. This motion corresponds to a rigid rotation with constant an- 
gular velocity k which, without loss of generality, we take to be positive. Using the 
comoving angular coordinate i|j := cp — Kt, the split (11221) is now furnished by 

= cyW^^/^|cdt-:p-^^^0-p pdil^l , (139a) 

h = dz^ + dp2 + ^ . (139b) 

The metric h, is curved (cf. LemmafT6l. But the rigidity condition (I123t means that 

h, and hence its curvature, cannot change along the motion. Therefore, even though 
we can keep a body in uniform rigid rotational motion, we cannot put it into this state 
from rest by purely rigid motions, since this would imply a transition from a flat to a 
curved geometry of the body. This was first pointed out by Ehrenfest 1 17 1. Below we 
will give a concise analytical expression of this fact (cf. equation (I143t ). All this is in 
contrast to the translational motion, as we will also see below. 

The proof of TheoremfTSl relies on arguments from differential geometry proper 
and is somewhat tricky. Here we present the essential steps, basically following 1 391 
and 103 1 in a slightly modernized notation. Some straightforward calculational details 
will be skipped. The argument itself is best broken down into several lemmas. 

At the heart of the proof lies the following general construction: Let M be the 
spacetime manifold with metric g and O C M the open region in which the normal- 
ized vector field u is defined. We take D. to be simply connected. The orbits of u 
foliate O and hence define an equivalence relation on O given by p ~ q iff p and q lie 
on the same orbit. The quotient space O := 0/~ is itself a manifold. Tensor fields on 
tl can be represented by (i.e. are in bijective correspondence to) tensor fields T on O 
which obey the two conditions: 

nHT = T, (140a) 
LuT = . (140b) 

Tensor fields satisfying (I140at are called horizontal, those satisfying both conditions 
(I140t are called projectable. The (n — 1 ) -dimensional metric tensor h, defined in 
(I122t . is an example of a projectable tensor if u generates a rigid motion, as assumed 
here. It turns [Cl^K) into a (n — 1 ) -dimensional Riemannian manifold. The covariant 
derivative ^ with respect to the Levi-Civita connection of h is given by the following 
operation on projectable tensor fields: 

'^:=nhoV (141) 

i. e. by first taking the covariant derivative V (Levi-Civita connection in (M, g)) in 
spacetime and then projecting the result horizontally. This results again in a projectable 
tensor, as a straightforward calculation shows. 

The horizontal projection of the spacetime curvature tensor can now be related to 
the curvature tensor of Cl (which is a projectable tensor field). Without proof we state 
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Lemma 16. Let u generate a rigid motion in spacetime. Then the horizontal projection 
of the totally covariant (i.e. all indices down) curvature tensor R o/ (O, g) is related 
to the totally covariant curvature tensor ft of [D., h) by the following equatiorr''^ : 

nhR = -ft-3(id-nA)cu0cu, (142) 

where Ha is the total antisymmetrizer, which here projects tensors of rank four onto 
their totally antisymmetric part. 

Formula (11421) is true in any spacetime dimension n. Note that the projector (id — Ha) 
guarantees consistency with the first Bianchi identities for R and ft, which state that the 
total antisymmetrization in their last three slots vanish identically. This is consistent 
with (11421) since for tensors of rank four with the symmetries of cu cu the total 
antisymmetrization on tree slots is identical to TTa, the symmetrization on all four 
slots. The claim now simply follows from TTa ° (id — TTa) = Ha — TTa = 0. 

We now restrict to spacetime dimensions of four or less, i.e. n < 4. In this case 
Ha ° Hh = since TTh makes the tensor effectively live over n — 1 dimensions, 
and any totally antisymmetric four-tensor in three or less dimensions must vanish. 
Applied to (11421) this means that TTa(uj (8) tu) = 0, for horizontality of cu implies 
cu cu = TTh(cu (g) cu). Hence the right hand side of (11421) just contains the pure tensor 
product —3 cu (g) cu. 

Now, in our case R = since (M, g) is flat Minkowski space. This has two inter- 
esting consequences: First, [Cl, h) is curved iff the motion is rotational, as exemplified 
above. Second, since ft is projectable, its Lie derivative with respect to u vanishes. 
Hence (11421) implies L^^cu cu + cu (g) L^^cu = 0, which is equivalent to^^ 

Ucu = . (143) 

This says that the vorticity cannot change along a rigid motion in flat space. It is the 
precise expression for the remark above that you cannot rigidly set a disk into rotation. 
Note that it also provides the justification for the global classification of rigid motions 
into rotational and irrotational ones. 

A sharp and useful criterion for whether a rigid motion is Killing or not is given 
by the following 

Lemma 17. Let u be a normalized timelike vector field on a region O C M. The 
motion generated by u is Killing iff it is rigid and 0? is exact on O. 

Proof. That the motion generated by u be Killing is equivalent to the existence of a 
positive function f : D — ) R such that Lf^^g = 0, i.e. V V (fu'') = 0. In view of (I135al) 
this is equivalent to 

29 + (dlnf + c"^a^) Vu^ = 0, (144) 

which, in turn, is equivalent to 6 = and a** = — c^dlnf. This is true since 9 is 
horizontal, TTh9 = 9, whereas the first term in (11441) vanishes upon applying TTh- The 

6 appears with a minus sign on the right hand side of <142t because the first index on the hatted 
curvature tensor is lowered with h. rather than g. This induces a minus sign due to <122t . i.e. as a result 
of our 'mostly-minus'-convention for the signature of the spacetime metric. 
" In more than four spacetime dimensions one only gets (id — Ha )(Lua)®cu + a)(8)LuCu) — 0. 
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result now follows from reading this equivalence both ways: 1) The Killing condition 
for K := fu implies rigidity for u and exactness of 2) Rigidity of u and a'' = — dO 
imply that K := fu is Killing, where f := expfO/c^]. □ 

We now return to the condition (I143t and express L^cu in terms of du^ For this 
we recall that Li^u'' = a'' (cf. footnotel26l and that Lie derivatives on forms commute 
with exterior derivatives^^. Hence we have 

2 L^cu = Lutnndu^ ) = Uhda^ = da^ ~ c^V A L^a^ . (145) 

Here we used the fact that the additional terms that result from the Lie derivative of 
the projection tensor TTh vanish, as a short calculation shows, and also that on forms 
the projection tensor TTh can be written as TTh = id — c^^v!' A i^, where denotes 
the map of insertion of u in the first slot. 
Now we prove 

Lemma 18. Let u generate a rigid motion in flat space such that cu 7^ 0, then 

Ua^ = . (146) 

Proof. Equation (I143t says that cu is projectable (it is horizontal by definition). Hence 
^tu is projectable, which implies 

^^0^ = 0. (147) 

Using (I134t with 9 = one has 

^cu = UhVcu = HhWu'' - c^^HhIVu'' ® a^] . (148) 

Antisymmetrization in the first two tensor slots makes the first term on the right van- 
ish due to the flatness on V. The antisymmetrized right hand side is hence equal to 
— c^^tu ® . Taking the Lie derivative of both sides makes the left hand side vanish 
due to (11471) . so that 

U(tu®a^) = cu(g)Ua'' =0 (149) 
where we also used (I143t . So we see that L^^a'' = if cu 7^ 0.^^ □ 

The last three lemmas now constitute a proof for TheoremfTSl Indeed, using (I146t 
in (I145t together with (11431) shows da'' = 0, which, according to LemmafTTl implies 
that the motion is Killing. 

Next we turn to the second part of the theorem of Noether and Herglotz, which 
reads as follows: 

Theorem 19 (Noether & Herglotz, part 2). All irrotational rigid motions in 
Minkowski space are given by the following construction: take a twice continuously 
differentiable curve T h-> z(t) in Minkowski space, where w.l.o.g T is the eigentime, so 
that = c^. Let := z(t) + (z(t))^ be the hyperplane through z[x) intersecting 
the curve z perpendicularly. Let O. be a the tubular neighborhood of z in which no 
two hyperplanes H^, H^:' intersect for any pair z(t], z(t'] of points on the curve. In 
D. define u as the unique (once differentiable) normalized timelike vector field perpen- 
dicular to all Ht: n D.. Thefiow ofu is the sought-for rigid motion. 

This is most easily seen by recalling that on forms the Lie derivative can be written as Lu = d o + 

iu o d, where tu is the map of inserting u in the first slot. 

We will see below that <146t is generally not true if cu = 0; see equation jl58> . 
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Proof. We first show that the flow so defined is indeed rigid, even though this is more 
or less obvious from its very definition, since we just defined it by 'rigidly' moving a 
hyperplane through spacetime. In any case, analytically we have, 

= {x G I f (t, x) := z(t) • (x - z(t)) = 0} . (150) 

In n any x lies on exactly one such hyperplane, 14^, which means that there is a 
function ct : O — > M so that t = a(x) and hence F(x) := f (a(x), x) = 0. This implies 
dF = 0. Using the expression for f from (I150t this is equivalent to 

da = o a/[c^ - (z o ct) • (id - z o cr)] , (151) 

where 'id' denotes the 'identity vector-field', x i— > x*^9^, in Minkowski space. Note 
that in O we certainly have (t, ^) 7^ and hence z • (x — z) ^ c^. In O we now 
define the normalized timelike vector field^^ 

u:=zoo-. (152) 

Using (I151t . its derivative is given by 

Vu^ = da «) (z^ o a) = [(z^ o a) ^ (z^ o a)] /(N V) , (153) 

where 

N := 1 - (zo a) • (id-zo a)/c^. (154) 

This immediately shows that TThVu'' = (since n^z'' = 0) and therefore that 9 = 
tu = 0. Hence u, as defined in (I152t . generates an irrotational rigid motion. 

For the converse we need to prove that any irrotational rigid motion is obtained 
by such a construction. So suppose u is a normalized timelike vector field such that 
9 = tu = 0. Vanishing cu means TThIV A u'') = TTnldu''] = 0. This is equivalent 
to u'' A du'' = 0, which according to the Frobenius theorem in differential geometry 
is equivalent to the integrability of the distribution^^ u'' = 0, i.e. the hypersurface 
orthogonality of u. We wish to show that the hypersurfaces orthogonal to u are hy- 
perplanes. To this end consider a spacelike curve z(s), where s is the proper length, 
running within one hypersurface perpendicular to u. The component of its second 
s-derivative parallel to the hypersurface is given by (to save notation we now simply 
write u and u'' instead of u o z and u'' o z) 

TThz = z — c^^uu''(z) = z + c^^u9(z, z) = z , (155) 

where we made a partial differentiation in the second step and then used 9 = 0. 
Geodesies in the hypersurface are curves whose second derivative with respect to 
proper length have vanishing components parallel to the hypersurface. Now, (I155t 
implies that geodesies in the hypersurface are geodesies in Minkowski space (the hy- 
persurface is 'totally geodesic'), i.e. given by straight lines. Hence the hypersurfaces 
are hyperplanes. □ 

^* Note that, by definition of cr, (z o cr) • (id — z o ct) = 0. 

'Distribution' is here used in the differential-geometric sense, where for a manifold M it denotes an 
assignment of a linear subspace Vp in the tangent space Tp M. to each point p of M. The distribution 
u'' = is defined by Vp = {v G TpM. | Up(v) = Up ■ v = 0}. A distribution is called (locally) 
integrable if (in the neighborhood of each point) there is a submanifold M ' of M whose tangent space 
at any p G M ' is just Vp . 
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Theorem^] precisely corresponds to the Newtonian counterpart: The irrotational 
motion of a rigid body is determined by the worldhne of any of its points, and any 
timelike worldline determines such a motion. We can rigidly put an extended body 
into any state of translational motion, as long as the size of the body is limited by c^/ cx, 
where a is the modulus of its acceleration. This also shows that (I146t is generally not 
valid for irrotational rigid motions. In fact, the acceleration one-form field for (I152t is 

= (z^ o a)/N (156) 

from which one easily computes 
da^ = (z^cr)Al(nHZ^ o a) + (z^ o ^) ° o") • (id-zoa) | _ 

From this one sees, for example, that for constant acceleration, defined by U^z = 
(constant acceleration in time as measured in the instantaneous rest frame), we have 
da'' = and hence a Killing motion. Clearly, this is just the motion (I128t for the boost 
Killing field (I127t . The Lie derivative of a'' is now easily obtained: 

Ua^ = i^da^ = (U^z^ o o-jN"^ , (158) 

showing explicitly that it is not zero except for motions of constant acceleration, which 
were just seen to be Killing motions. 

In contrast to the irrotational case just discussed, we have seen that we cannot put 
a body rigidly into rotational motion. In the old days this was sometimes expressed 
by saying that the rigid body in SR has only three instead of six degrees of freedom. 
This was clearly thought to be paradoxical as long as one assumed that the notion of 
a perfectly rigid body should also make sense in the framework of SR. However, this 
hope was soon realized to be physically untenable ll37l . 

5.6 Geometry of space and time in rotating reference frames 

We have seen above that there is a generalization of Einstein simultaneity for the case 
of rigid linear accelerations. The hypersurfaces of simultaneity were given by the 
hyperplanes of constant Killing time A, which are different from the (curved) hyper- 
surfaces of constant proper time t. This worked because the Killing field was (locally) 
hypersurface orthogonal. 

Note that in terms of the co-rotating coordinates (ct,z, p,4)) (recall that i|j = 
cp — Kt) the Killing field (I137t is just K = 9t. It is convenient to rewrite the spacetime 
metric g = c^^u'' (g) — h in the following form 

g = exp(20/c^) A® A-h, (159) 

where h is given by ( I139bt and, using ( I139al ). we have the following expressions for 
O and A: 

O := Inj^KVc^l = 4 In {l - (kp/c)^} , (160a) 

A := kVk^ = dt - ^ . (160b) 

1 - Kp/cK 
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The physical interpretation of O appears from calculating the acceleration a that an 
observer experiences who moves along the Killing orbit: 

:= v,,u^ = -dO . (161) 

Hence O is the Newtonian potential that is accelerating the Killing observer 

The rotational Killing field is clearly not hypersurface orthogonal. The obstruction 
is just given by the vorticity tu. A simple calculation gives 

F := dA = 2c"^ expl-O/c^) cu . (162) 

Hence the obstruction for hypersurface orthogonality is likewise faithfully measured 
by A. Moreover, as we shall see below, the 1-form A has an interesting physical and 
geometric interpretation, which is the actual reason why we introduced it here. 

Inside the region O (defined in (I138t ) K is a complete and nowhere vanishing 
timelike vector field. This means that the flow f:MxO— jOofK defines a free action 
of the additive group M on O that makes D. the total space of a principle bundle with 
fiber M and base ii = 0/~. Here ~ is again the equivalence relation which declares 
two points in D. to be equivalent iff they lie on the same K orbit. Hence O, which is 
obviously diffeomorphic to the solid cylinder {(z, p, cp) | p < c/k), is the space of 
K orbits. Since D. is endowed with the metric g and since K acts by isometrics, the 
distribution of hyperplanes (I12U orthogonal to the Killing orbits define a connection 
on the principal bundle whose corresponding 1-form is just A.^^ Accordingly, the 
bundle curvature is given by F = dA. Note that F can be considered as 2-form on ii 
since ikF = and LkF = ixdF = 0.^^ 

Now, parallel transport defined by the connection A has a direct physical interpre- 
tation: it is just transportation of time according to Einstein synchronization. Since 
F / this transportation is not path independent. In particular this implies that syn- 
chronization along fixed paths is not a transitive operation anymore. Given two points 
in ii connected by a spatial path ^ in ii, parallel transportation along ")> requires that 
we lift ^ to a path y in O whose tangent vectors are annihilated by A, that is, which 
runs orthogonally to the orbits of K. But this is just what we mean by saying that 
the points on the curve y are locally Einstein synchronized, in the sense that any two 
infinitesimally nearby points on y are Einstein synchronized. Hence the integral of A 
along y vanishes. Using (I160bt this is equivalent to 



At; 



K 

dt = -T 

-y ^ 



^ , (163) 



1 - (kp/c 



where we interpreted p and '\\) as coordinates on ii so that the right hand side could be 
written as integral along the curve i> in O. This means that if we Einstein synchronize 
clocks along in space, the clock at the final point of i> shows a lapse At of coordinate- 
time as compared to the clock at the initial point of A striking consequence of the 

The connection 1-form A associated to tiie distribution of 'horizontal' subspaces has to fulfill two 

conditions: 1) vectors tangential to the horizontal subspaces are annihilated by A and 2) A(K) = 1 , 

where K is a 'fundamental vector field' which generates the action of the structure group R. Both 

conditions are satisfied in our case. See e.g. |9 1 for a lucid discussion of these notions. 

More precisely, there is a unique 2-form f on O such that 7t*1- = F, where 7t : O — > O is the bundle 

projection. 
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non-transitivity of Einstein synchronization is the non-zero lapse of coordinate time 
that one obtains for spatially closed curves. These lapses are just the holonomies of 
the connection A. If, for simplicity, we choose "}> to be a closed planar loop of constant 
p and z, (I163t immediately leads to 

At='^J^flf-,^i2./c^)S, (164) 

K 1 — [Kp/Cj"^ 

where S is explained below. The lapse in proper time. At, is obtained by multiplying 
this result with expfO/c^), which merely amounts to replacing the denominator 1 — 
(Kp/c)^ in (I164t with its square root. This time lapse is directly related to the Sagnac 
effect. In fact, the observed phase shift in the Sagnac effect is obtained by multiplying 
the expression for the time lapse with twice^** the light's frequency -y. 

In (I164t S denotes the area of the 2-disk spanned by the planar loop. Note that this 
area is only approximately given by 7tp since the geometry in O, determined with co- 
rotating rods and clocks, is given by the metric h; see (I139bt . The precise expressions 
for the circumference, C and area, S, of the planar loop of constant p follow from 
(fT39bl : 



rln 



c 



dii> p 2np 



/T^Ikp/cP /T^Ikp/cF 
^^"fp dil^dp'p' 2nc^ 



> 2np, (165a) 



/T^TkpVcP 



k2 



^ -^/^- (kp/c)2 *> > 7Tp^ (165b) 



The circumference grows faster than oc p and the area faster than oc p^. Note that 
according to ( I139bt p is the geodesic radial distance. Hence the two-dimensional 
hypersurfaces of constant z in 12 are negatively curved. In fact, the Gaussian curvature, 
/C, of these hypersurfaces turns out to be 

IC = , , (166) 

{1-(KP/C)2}^ 

which is strictly negative, approximately constant for p <C c/k, and unbounded as 
p approaches the critical radius c/k. In contrast, according to ( I139bl ). the metrics 
induced by h, on the hypersurfaces of constant i|; are flat. 

The bundle curvature of F for the connection A and the Riemannian curvature for 
[Cl,h) are indeed intimately linked through identities which arise by calculating the 
Riemannian curvature of (O, g), where g is parameterized as in (I159t . and noting that 
g is flat (Minkowski metric). One such identity is the so called Kaluza-Klein identity, 
which expresses the scalar curvature (Ricci scalar) of g in terms of the scalar curvature 
Rh of h, O, and ||F||^ = H^h'^F^jFi^i^. Since the scalar curvature of g is zero, one 
obtains: 

Rh = 2exp(-(D/c2)AHexp(0/c2) - \ c^exptlO/c^) ||F||^ , (167) 

where Ah is the Laplace operator on (Q, h). 

The factor 2 results simply from the fact that the Sagnac effect measures the sum of the moduli of time 
lapses for a closed curve traversed in both directions. 
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It is an interesting historical fact that it was Kaluza who pointed out that 'space' 
in rotating reference frames cannot be identified with a submanifold perpendicular to 
the Killing orbits (because such a submanifold does not exist) but rather has to be 
constructed as the quotient manifold tl which carries the curved metric h |35|. He 
also discussed the non-integrability of Einstein synchronization. This he did in 1910, 
ten years before he applied the very same mathematical ideas to the five-dimensional 
setting known as Kaluza-Klein theories. 
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A Appendices 



For the interested reader this appendix collects some mathematical background mate- 
rial related to various points discussed in the main text. 

A.l Sets and group actions 

Given a set S, recall that an equivalence relation is a subset R C S x S such that for 
all p, q,r G S the following conditions hold: 1) (p,p) G R (called 'reflexivity'), 2) if 
(p, q) G R then (q,p) G R (called 'symmetry'), and 3) if (p, q) G R and (q,r) G R 
then (p,r) G R (called 'transitivity'). Once R is given, one often conveniently writes 
p ~ q instead of (p, q) G R. Given p G S, its equivalence class, [p] C S, is given 
by all points R-related to p, i.e. [p] := {q G S | (p,q) G R}. One easily shows 
that equivalence classes are either identical or disjoint. Hence they form a partition of 
S, that is, a covering by mutually disjoint subsets. Conversely, given a partition of a 
set S, it defines an equivalence relation by declaring two points as related iff they are 
members of the same cover set. Hence there is a bijective correspondence between 
partitions of and equivalence relations on a set S. The set of equivalence classes is 
denoted by S/R or S/~. There is a natural surjection S — > S/R, p i— > [p]. 

If in the definition of equivalence relation we exchange symmetry for antisymme- 
try, i.e. (p,q) G R and (q,p) G R implies p = q, the relation is called a partial 
order, usually written as p > q for (p , q ) G R. If, instead, reflexivity is dropped and 
symmetry is replaced by asymmetry, i.e. (p, q) G R implies (q,p) R, one obtains a 
relation called a strict partial order, usually denoted by p > q for (p , q ) G R. 

An left action of a group G on a set S is a map ([) : G x S — > S, such that 4)(e, s) = s 
(e = group identity) and 4)(gh, s) = 4)(g, 4>(h,, s)). If instead of the latter equation 
we have (|)(gH, s) = 4)(H, (|)(g, s)) one speaks of a right action. For left actions one 
sometimes conveniently writes ct)(g,s) =: g • s, for right actions 4)(g,s) =: s • g. 
An action is called transitive if for every pair (s, s') G S x S there is a g G G such 
that 4)(g,s) = s', and simply transitive if, in addition, (s,s') determine g uniquely, 
that is, 4)(g, s) = (^[g',s) for some s imphes g = g'. The action is called effective 
if 4)(g, s) = s for all s implies g = e ('every g ^ e moves something') and free if 
(t)(g, s) = s for some s implies g = e ('no g 7^ e has a fixed point'). It is obvious 
that simple transitivity implies freeness and that, conversely, freeness and transitivity 
implies simple transitivity. Moreover, for Abelian groups, effectivity and transitivity 
suffice to imply simple transitivity. Indeed, suppose g • s = g ' • s holds for some s G S, 
then we also have k- (g • s) = k - (g' • s) for all k G G and hence g • (k- s) = g' • (k- s) 
by commutativity. This implies that g • s = g' • s holds, in fact, for all s. 

For any s G S we can consider the stabilizer subgroup 

Stab(s) :={g G G I cj)(g,s) =s}C G. (168) 

If cj) is transitive, any two stabilizer subgroups are conjugate: Stab(g • s) = 
gStab(s)g^^ By definition, if <^ is free all stabilizer subgroups are trivial (consist 
of the identity element only). In general, the intersection G' := Plses Stab(s) C G is 
the normal subgroup of elements acting trivially on S. If (|) is an action of G on S, then 
there is an effective action $ of 6 := G/G' on S, defined by $([g],s) := 4)(g,s), 
where [g] denotes the G'-coset of G' in G. 
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The orbit of s in S under the action cj) of G is the subset 

Orb(s):={4)(g,s)|gGG}CS. (169) 

It is easy to see that group orbits are either disjoint or identical. Hence they define a 
partition of S, that is, an equivalence relation. 

A relation R on S is said to be invariant under the self map f : S — ) S if (p , q ) G 
R <^ (f(p),f(q)) G R. It is said to be invariant under the action cj) of G on S if 
(p,q) G R <4 ((})(g,p), c()(g, q)) G R for all g G G. If R is such a G-invariant 
equivalence relation, there is an action c|)' of G on the set S/R of equivalence classes, 
defined by 4)'(g, [p]) := [(|)(g,p)]. A general theorem states that invariant equivalence 
relations exist for transitive group actions, iff the stabilizer subgroups (which in the 
transitive case are all conjugate) are maximal (e.g. Theorem 1.12 in |33l). 

A.2 Structures on vector and affine spaces 
A.3 Non degenerate bilinear forms 

Consider a vector space V of dimension n over F (here denoting M or C). Let it be en- 
dowed with a non-degenerate bilinear form tu : V x V — > F. No assumptions regarding 
symmetries of tu are made at this point. The dual space of V is denoted by V* whose 
elements we will denote by Greek letters. The set of linear maps V — > V is denoted 
by End(V), called the endomorphisms of V, which forms an associative algebra over 
F (algebra multiplication being composition of maps). The set of invertible elements 
in End(V) (i.e. isomorphisms of V) will be denoted by GL(V); it forms a group under 
composition. Generally, composition of maps will be denoted by o. 
The form tu defines an isomorphism 

tu^:V^V*, tu^(v) := tu(v,-) , (170) 

with inverse map being denoted by 

tuT:V*^V, tu^:=(tu^)-\ (171) 

so that 

tu^ o tu^ = idv and tu-^ o tu^ = idy . (172) 

Recall that 'transposition' is a map End(V) — > End(V*), denoted by A i— > A^ and 
defined through A^(a) := ao A. This map is an anti-isomorphism of algebras ('anti', 
since it obeys (A o B)^ = o A^). Different from this canonically defined notion of 
transposition is the 'tu-transposition', which is an isomorphism End(V) — > End(V), 
which we denote by A^ (the dependence on tu being implicitly understood) and which 
is defined through 

tu(AVv] = tu(u, Av] Vu,vgV. (173) 

Note that the tu-transposed is in End(V) whereas the canonical transposed is in 
End(V*]. The relations between the two are 

A^ = tu^ o A^ o tu^ and A^ = tu^- o A^ o tu^ . (174) 



54 



A.4 Generalized orthogonal transformations 

A generalized orthogonal transformation of (V, w) is any bijective map c}) : V — > V 
such that cu(ct)(u), (|)(v)) = cu(u,v) for all u,v € V. In this subsection we shall 
restrict to symmetric w. Note that any symmetric bilinear form cu is uniquely deter- 
mined by its quadratic form, i.e. the function d) : V — > F, v h-> (x)[v) := w[v,v), for 
we have tu(u,v) = ^((^^(u + v) — 6^[^x] — (i)[v)). It is sometimes useful to consider 
generalizations of distance measures by setting d(u,v) := y|^(u — v)l- This is e.g. 
done in SR, where one speaks of timelike and spacelike distances in that sense. Now 
suppose (p is an isometry with respect to d, i.e. d((p(u), cp(v)) = d(u, v) for all u, v. 
Consider c}) defined by c|)(u) := (p(u) — (p(0). Clearly cp is an isometry of d if c}) is a 
generalized orthogonal transformation with respect to tu. Now, generahzed orthogonal 
transformations are necessarily linear: 

Proposition 20. Let w be a non-degenerate symmetric bilinear form on V and let 
4) : V — ) V a generalized orthogonal transformation with respect to cu. Then ^ is 
linear 

Proof. Consider I := a)(a(t)(u) + bc|)(v) — c|)(au + bv),w); surjectivity^^ allows 
to write w = ^[z], so that I = aa)(u, z) + buj(v,z) — cu(au + bv,z) = for all 
z G V. Hence the aforementioned expression for I is zero for all w G V, which by 
non-degeneracy of cu implies the linearity of c}). □ 

Particularly simple generalized orthogonal transformations are given by reflections 
on non-degenerate hyperplanes. To explain this, let v G V and V"*- := {w G V | 
cu(v, w) = 0} C V. is a linear subspace of co-dimension one, that is, a hyperplane. 
That it be non-degenerate means that cu|^± is non-degenerated, which is easily seen to 
be the case iff cu(v,v) / 0. The reflection at the non-degenerate hyperplane is the 
map 

Pv(x):=x-2v — 5-. (175) 

where for convenience we wrote u • v := cu(u, v) and := v • v. pv is easily seen 
to be an involutive (i.e. pv o = idy) generalized orthogonal transformation. If cj) is 
any other generalized orthogonal transformation, the following equivariance property 
holds: cj) o p.y o cj)^^ = Ptjj(v)- An important result is now given by 

Theorem 21 (Cartan, Dieudonne). Let the dimension of V be n. Any generalized 
orthogonal transformation o/(V, cu) is the composition of at most n reflections. 

Proof. Comprehensive proofs may be found in l33l or Q. Here we offer a proof of 
the weaker result, that any generalized orthogonal transformation is the composition 
of at most 2n — 1 reflections. So let cj) be generalized orthogonal and v G V so that 
7^ (which certainly exists). Let w = ct)(v), then (v + w)^ + (v — w]^ = 4v^ / 
so that w + V and w — v cannot simultaneously have zero squares. So let (v =F w)'^ 7^ 
(understood as alternatives), then Pv=fw(v] = ±w and pv^w(w] = ±v. Hence v is 
eigenvector with eigenvalue 1 of the generalized orthogonal transformation given by 

ct)' = (p^-°* if(v-w)VO, ^^^^^ 

PvoPv+wOcfj if(v-w]^ = 0. 



Note that we only use surjectivity here, so that the hypotheses for this result may be slightly reduced. 



55 



Consider now the generalized orthogonal transformation cf^'l^x on v-*- with induced 
bilinear form | , which is non-degenerated due to ^ 0. We now conclude by 
induction: At each dimension we need at most two reflections to reduce the problem 
by one dimension. After n — 1 steps we have reduced the problem to one dimension, 
where we need at most one more reflection. Hence we need at most 2(n — 1 ) + 1 = 
2n — 1 reflections which upon composition with c|) produce the identity. Here we use 
that any generalized orthogonal transformation in V"*- can be canonically extended to 
span{v} © by just letting it act trivially on span{v}. □ 

There are several useful applications of this result, most notably in the construction 
of the Spin groups. Other applications in SR are discussed in Ii50i . 

A.5 Index raising and lowering 

Let {ea}a=i,- - ,n be a basis of V and {ri'^ja^i, .. ,n its (canonical) dual basis of V*, 
which is defined by ri'^(eb) = 6^. Using o)^ and cu^ one can define the tu-duals of 
{Cq} and {ri"} respectively, given by 

Tia := cu^(ea) G V*, (177a) 
e"" := wHri'') e V , (177b) 

so that, writing Wah '■= (^a, ^b) and cv"-^ for the components of the inverse-transposed 
matrix (i.e. tUac'^^'^ = cucacu'^^ = 6^), 

r\a := u^^iea) = o^abTl^ , (178a) 
e"" := wH-q'') = cu^'^eb. (178b) 

Using components with respect to the canonical dual bases, so that v = v"-ea G V with 
tu^ (v) =: VaTi"' and a = 0CaT["' S V* with cu^(a) =: ot°-ea, one obtains the equivalent 
to (11781) in coordinates: 

Va = V^OJba , (179a) 
a"" = cu'^^ab. (179b) 

It should be clear from (I178t and ( I179t why the maps tu^ and cu^ are called 'index 
lowering' and 'index raising'. Often, if there is no ambiguity as to what structure w is 
used, the following notation is employed: Let v G V and cc G V*, then ujJ-(v) =: v'' G 
V* and cu'r(a) =: G V. 

Finally we remark on the choice of conventions. Comparing e.g. (I179at with 
( I179bl ) one notices that one sums over the first index on cu for lowering and over the 
second index for raising indices (on the bases (I178t it is just the other way round). This 
is a consequence of the following requirements: 1) raising and lowering of indices are 
mutually inverse operations, and 2) the matrix, {tu*^^}, used for raising indices is the 
transposed inverse of {Wab}- The rationale for the second condition is the requirement 
that lowering both indices on {cu"^} using {cuab} should reproduce {cuab} and raising 
both indices on {cUab} using {cu*^^} should reproduce {a)°^]. This enforces 2). 

Note again that so far no assumptions were made concerning the symmetries of 
tu. In the general case there are, in fact, two raising-lowering operations: One as 
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given above, the other by replacing (11701) with d)^(v) := w[-,v), i.e. v now being in 
the second rather than the first slot. For this second operation we have all formulae 
as above with {cUqi,} and {cu*^^} being replaced by the transposed matrices. In phys- 
ical applications w is either symmetric — like in case of the Minkowski metric — or 
antisymmetric — like for the 2-spinor metric (symplectic form). In those cases there 
is — up to sign in the second case — a unique pair of lowering and raising operations. 

A.6 Linear frames 

A basis f = {ea]a=^ ,tl of V can be viewed as a linear isomorphism (also denoted by 
f), f : F"^ — > V, given by f (v^ , • • • , v"^) = v^Ca- With this interpretation we call the 
basis f a linear frame. Any frame f induces an isomorphism of algebras End(F"^) — > 
End(V), given by A := f o A o f-i. If A = {A^}, then A^e^) = A^eb- 

The standard (linear) action cj) of GL(F"^) on F"^, (t)(A,x) := Ax, thereby translates 
in an f-dependent way to an action of GL(F"^) on V, defined by ct)^(A,v) := 
f o ct)(A,f"''(v)); that is, (A,v) i-) A^v = f(Ax), where f(x) = v. 

Let JTy denote the set of frames for V. The general linear group GL(F") acts 
transitively and freely on JTy from the right: 

GL(V) X JP-y^ J^v, (A,f) •A:=f oA. (180) 
Proper subgroups of GL(F"^) continue to act freely on JTy. 

A.7 Affine spaces 

An affine space over the vector space V is a set Aff(V) together with an effective 
and transitive action ^ of V, considered as Abelian group (group multiplication being 
vector addition). Since the group is Abelian, this suffices to imply that the action is 
free and simply transitive. One writes 4)(ra,v) =: m + v, which defines what is meant 
by '+' between an element of Aff (V) and an element of V. Any ordered pair of points 
(p , q ) € Aff ( V) X Aff ( V) uniquely defines a vector v, namely that for which p = q +v. 
One writes p — q = v, defining what is meant by '— ' between two elements of Aff(V). 
Considered as Abelian groups, any linear subspace W C V defines a subgroup. The 
orbit of that subgroup in Aff(V) through m G Aff(V) is an affine subspace, denoted 
by Wtt^, i.e. 

WTT^ = m + W:={m + w|wG W}, (181) 

which is an affine space over W in its own right of dimension dim(W). One- 
dimensional affine subspaces are called (straight) lines, two-dimensional ones planes, 
and those of co-dimension one are called hyperplanes. 

A.8 Affine frames 

A basis for Aff(V) is a tuple F := (m, f ), where ra is a point in Aff(V) and f a basis 
of V. F can be considered as a map F^ — > Aff(V), given by F(x) := f (x) + m (here f 
is interpreted as linear frame). With this interpretation F is called an affine frame. We 
denote the set of affine frames by J^xn[Y] ■ 
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The general affine group of F"- is given in the familiar fashion by the semi-direct 
product X GL(F"^), which acts on F^ in the standard way: (j) : ((a, A),x) 
4)((a, A),x) := A(x) + a. Its multiphcation law is given by: 

(ai,Ai)(a2,A2) = (ai + Aiaa , A1A2) . (182) 

Depending on the choice of a frame F G ^Aff(V) the action 4> of F"^ x GL(F"^) on F"- 
translates to an action of xi GL(F^) on Aff(V) as follows: (t)^{(a, A),p) := 
F o 4)((a, A), F^^ (p)); in other words, if F = (ra, f) and F(x) = p, we have (^^ : 
((q,A),p) H^F(Ax + a) =Af(p-m) + m + f(a). 

F"^ X GL(F"^] has an obvious right action on ^Aff(V)' given by (g, F) 1— > F • g := 
Fog. Explicitly, for g = (a, A) and F = (m, f ), this reads 

F- g = (m,f) • (a,A) = (m + f(a),f o A) . (183) 

A.9 Lie algebras for matrix groups 
A.9.1 General considerations 

We first recall the definition of a Lie algebra: 

Detinition 8. A Lie algebra over F (here denoting M or C) is a vector space L over F 
endowed with a map (called the 'Lie bracket') L x L ^ L, (X, Y) [X, Y], which for 
allX,Y,Z G Lobeys: 

[X , Y] = - [Y X] (anti-symmetry) , ( 1 84a) 

[aX + Y Z] = a[X, Z] + [Y Z] (linearity) , (184b) 

[X, [Y, Z]] + [Y, [Z, X]] + [Z, [X, Y]] = (Jacobi identity) . (184c) 

A Lie subalgebra V C L is a linear subspace which becomes a Lie algebra when 
the bracket is restricted to L', i.e. if [L', L'] C L'. A Lie subalgebra is called an ideal 
if the stronger condition holds that [L', L] C L'. It is easy to see that if L' is an ideal 
the quotient L/L' is again a Lie algebra: just define the bracket of two cosets as the 
coset of the bracket of two arbitrary representatives, which is well defined. 

In may cases of interest L is already given as an associative algebra and the Lie 
bracket is then defined as commutator: [X, Y] := X • Y — Y • X. This is e.g. the case if 
L C End(V) since, as already mentioned, the endomorphisms of a vector space V form 
an associative algebra if the multiplication is taken to be the composition of maps. 

Given a matrix group G C GL(n,F] we consider the set C](M,G) of all contin- 
uously differentiable curves A : M — > G such that A(0) = In (unit n x n-matrix). 
We define A := ■^A(s)|s=05 the 'velocity' of the curve A(s) at the group identity. We 
consider the set of all such velocities: 

Lie(G) := {A I A G Cl(M,G)} C End(M^) . (185) 

Proposition 22. Lie(G) is a real Lie algebra. 

Proof. First we prove that Lie(G) is a linear space: Let X, Y G Lie(G) and A,B G 
C](M,G)) such that X = A and Y = B. Define C G C^IM.G) by C(s) := A(s) • 
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B(ks), where k e M, then C = X + kY, showing that Lie(G) is a vector space over M. 
Here and below ' • ' denotes matrix multiphcation. Now, since Lie(G) C End(F"^), i.e. 
lies in an associative algebra, we define the Lie bracket on Lie(G) as commutator, that 
is [X, Y] := X • Y — X • X. This bracket clearly satisfies conditions (I184t ). But we still 
have to show that [X, Y] is in Lie(G) if X, Y are. That is, we have to show that there is 
a curve C e C] (M, G) such that C = [X, Y]. To do this, let again A, B e C] (M, G) be 
such that X = A and Y = B. Then the sought for C is given by 



Cfsl 



'a(t(s)) • B(t(s)) • A-^(t(s)) • B-Vt(s)) for s > 0, 
B(t(s)) • A(t(s)) • B-1(t(s)) • A-Vt(s)) for s < 0, 



(186) 



where 



T s := 



y/s for s > , 
— f or s < . 



(187) 



This curve is indeed differentiable at s = (though s i— > A[^/s) and s i— > B[y/s) are 
not). Its right derivative (s > 0) is: 



C 



lim 

s->0 S 



lim 

s^O 



[A(t(s)),B(t(s))]A-1(t(s))B-1(t(s)) 



lim 



A(T) 



BM -1, 



[X,Y]. (188) 



Its left derivative follows along the same lines, one just exchanges A f-> B and replaces 
s with — s, leading again to [X, Y]. □ 



A.9.2 Some special Lie algebras 

Before we restrict attention to the Lorentz group and its inhomogeneous counterpart 
(sometimes called the Poincare group), let us describe in general the situation of which 
they are special cases. 

Consider a vector space V of n dimensions over the field F (M or C). As before, 
End(V] denotes the associative algebra of linear maps V h-) V. Let GL(V) C End(V) 
denote the set of invertible linear maps, i.e. det(f) 7^ (compare (11991 )) for all f G 
GL(V). 

Given a subgroup G C GL(V), there is a corresponding inhomogeneous group, 
IG C IGL(V), given by the semi-direct product of V (considered as Abelian group 
under addition) with G, denoted by V xi G. Its multiplication law is as follows: 

(ai,Ai](a2,A2) = (ai +Ai(a2], Ai 0A2], a^eVA^eG (189) 

We endow V with a non-degenerate bilinear form tu : V x V — > F, which 
we restrict to be either symmetric (e = 1) or antisymmetric (e = —1), that is 
tu(v, w) = e tu(w,v) for all v, w G V. We want to consider the group G C GL(V) of 
tu preserving maps: 

G := {A e GL(V) I a;(Av,Aw) = g(v,w) Vv,weV}. (190a) 
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Using the 'index-lowering' map cu^ : V — > V*, v h-> cu(v, •] and its inverse tu^- : V* — > 
V, the 'index-raising' map (cf. Section lAT2t . this can also be written as 

G := {A G GL(V) | tu^oAocu^ = (A^)"^}. (190b) 

The Lie algebra Lie(G) is easily obtained by considering curves in G, as explained 
in the previous subsection. Using ( 11901 ) this leads to 

Lie(G) : = {X G End(V) I tu(Xv,w) + a)(v,Xw) =0 Vv,wgV}, (191a) 
= {X G End(V) I tu^ o X o cu^ = -X^} . (191b) 

Let us describe it more concretely in terms of components. Choose a basis {eaja^i.- TL 
of V and the corresponding dual basis {r\^]a=^■■■n of V*, so that r|"(eb) = 
From (I191bl) it follows that a general element X^Ca ^ r]^ G End(V) lies in Lie(G) 
iff Xab = — eXba, where X^^ := X^tUca- Hence, writing r\a '■= tUa^ri^ so that 
ila(eb) = <J^ah (cf. Sect. IA.5t . a basis for Lie(G) is given by the jn[n — e) vectors 

Mab = ea^rib - eeb 0ria. (192) 

The Lie algebra of the corresponding inhomogeneous group is given by the linear 
space V © Lie(G) and Lie bracket as follows: 

[(ai,Xi), (a2,X2)] = (Xi(a2)-X2(ai), [Xi.Xj). (193) 

Hence we obtain Lie(IG) by adding to ( 11921 ) the n translation generators 

Ta:=ea. (194) 

Together they span the ^^(n + 2 — e) -dimensional Lie algebra Lie(IG], whose com- 
mutation relations easily follow from ( I192ll93ll94t : 

ad~ £'J^ac'^bd~ £ '^bd^ac > (195a) 
[Mab.Tc] = OJbcTa- eCUacTb, (195b) 

[Ta,Tb] = 0. (195c) 
Two special cases of this general setting become relevant in SR: 

1 Let V = M"^ and cu symmetric with signature (1,3) (one plus, three minuses). 
The technical name of G is then 0(1 ,3). Generally, if V = M"^ and if cu is of 
signature (p,q), where p + q = n, G is called 0(p,q). 0(p,q) is isomorphic to 
0(q ,p) and O(rL,0) is just the ordinary orthogonal group 0(n) in n dimensions. 

2 Let V = and cu antisymmetric. In two dimensions, leaving an antisymmetric 
form invariant is equivalent to having unit determinant. Hence G = SL(2,C), 
the group of complex 2x2 matrices of unit determinant. The group SL(2,C) is 
the double (and also universal) cover of the identity component of 0(1 ,3), often 
denoted by 0^(1 ,3) or 80^(1 ,3). 
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A.IO Exponential map 

Since End(V) form an associative algebra, we can form functions based on addition 
and multiplication. Writing X"^ for the n-fold composition X o • • • o X, we can define 
the exponential map 

°° yn 

exp : End(V) -> End(V) , exp(X) := V — . (196) 

^ — n! 

n=0 

Note that the series converges absolutely with respect to the standard norms on 
End(V]. 

Now consider 'det' and 'trace', which are the familiar F- valued functions on 
End(V): 

det(X) := det^{Ti"(Aeb)), (197) 
trace(X) := ^''(Aea) , (198) 

where {Bq} and {r| ^} is any pair of dual bases (it does not matter which one) and where 
detra is the standard determinant function for matrices. We have 

Proposition 23. 

det o exp = exp o trace . (199) 

Proof. Assume V to be complex (complexify if V was real). For X G End(V) one 
can then find an eigenbasis, so that with respect to it X is a triangular matrix, whose 
diagonal entries are its eigenvalues. Then equation ( I199t reduces to the statement, 
that the product of the exponentials of the eigenvalues is the exponential of their sum, 
which is true of course. □ 

Equation ( I199t shows that det(exp(X)) > for any X G End(V). Moreover, any 
element A = exp(X) is connected to the identity by a continuous path s h-) exp(sX). 
Hence the image of End(V) under exp is contained in the identity component of 
GL(V), which is given by the invertible linear maps of positive determinant, denoted 
by GL+(V). 

Note that the curve s i— > exp(sX) is a homomorphism from the additive group M 
to GL(V). Conversely, we have 

Proposition 24. Let y : M — ) GL(V) be a homomorphism, i.e. a map that satisfies 

y(0)=l and y(s + t) = y(s) o y(t) for all s,t eM.. (200) 
Then y must be of the form y(s) = exp(sX), where X = y (0). 

Proof. We consider the curve (3(s) := y(s) o exp(— sX), which satisfies (3(0) = 1 
and (3(s) = y(s) — y(s) o X. But this is zero, as can be seen from differentiating 
y(s + t) = y(s) o y(t) with respect to t at t = 0. Hence (3(s) =1 for all s, showing 
thaty(s) = exp(sX). □ 
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Let us now regard the exponential map restricted to the special Lie subalgebras 
Lie(G) defined in ( I191I ). Since 

cu^ o exp(X) o tu^ = exp(cu-^ o X o w^) (201) 

for all X G End(X), the image of Lie(G) under the exponential map lies in G. More 
precisely, since Lie(G) is connected and exp continuous, the image must also be con- 
nected. Since it also contains the identity (1 = exp(O)), the image of Lie(G) lies in the 
identity component of G, denoted by Gi. Hence we have a map 

exp :Lie(G) ^ Gi. (202) 

It is clear that this map is generally not injective. Consider e.g. the group S0(2) of 
planar rotations, which is topologically a circle (S ^ ) and whose Lie algebra is the real 
line, exp winds the line infinitely often around the circle. But neither is exp generally 
onto. A relevant example is given by G = SL(2, C). Its Lie algebra is given by the 
space of traceless 2x matrices with complex entries. Now, for example, none of the 
matrices 

Aa:=(^~Q ^-^^ a/0 (203) 

can be in the image of the exponential map. To see this, first note that, within SL(2, C), 
A(a] can be continuously connected to the identity, e.g. by the path 

Aa(s) = f^^P(;"^^ y (204) 

\ exp(— iTTsJ J 

In fact, SL(2,C) is connected. Suppose now that exp(X) = Aq for some traceless 
X. The eigenvalues of X are itA 7^ so that X is diagonalizable. Let T G GL(2, C) 
such that TXT^^ = diag(A,— A), then TAqT^^ = diag(exp(A),exp(— A)), which is 
impossible since both eigenvalues of Aq equal —1 . 

What is however true is that the image of the exponential map covers a neighbor- 
hood of the group identity. This follows from the fact that the derivative of the smooth 
map (I202t evaluated at 1 G G is non-zero (it is the identity map Lie(G) — ) Lie(G]). 
Hence, by the inverse-function theorem, it has a local smooth inverse. Moreover, we 
have the following 

Proposition 25. Any A £ G is the finite product of elements in the image ofexp, that 
is, for any A G G there exist X| G Lie(G], i = 1 , • • • , k < 00, such that 

A = exp(Xi) o • • • o exp(Xk) . (205) 

Proof. We first note that elements of the form (1205 1 obviously form a subgroup G ' C 
Gi which contains a whole neighborhood U C 1 G Gi, as we have just seen. Now, 
for any A G G', the map La : G' — ) G', B h-) AB, is a smooth bijection with smooth 
inverse L^-i ■ Hence La is an open map (sends open sets to open sets) so that La(U) 
is an open neighborhood of A G Gi. This shows that G' C Gi is open. Likewise 
one shows that all cosets of G' in Gi are open, since they are obtained as images of 
G' under La for some A G Gi. But this shows that G' C Gi is also closed, since it 
is the complement of the union of all G '-cosets different from G' itself. Being open 
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and closed in the connected set Gi, G' is necessarily identical to it. [This argument 
shows in fact that any neighborhood U of the identity in a topological group generates 
the identity component, in the sense that any element in the identity component is the 
finite product of elements from U.] □ 



Acknowledgements 

I sincerely thank Jurgen Ehlers and Claus Lammerzahl for inviting me to the 2005 Pots- 
dam conference on Special Relativity and also giving me the opportunity to present this 
material in written form. 1 am also grateful to Matteo Carrera and Abraham Ungar for 
remarks and suggestions which led to various improvements. 



References 

[1] Alexander Danilovich Alexandrov. Mappings of spaces with families of cones 
and space-time transformations. Annali di Matematica (Bologna), 103(8):229- 
257, 1975. 

[2] James L. Anderson. Principles of Relativity Physics. Academic Press, New York, 
1967. 

[3] Henri Bacry and Jean-Marc Levy-Leblond. Possible kinematics. Journal of 
Mathematical Physics, 9(10): 1605-1614, 1968. 

[4] Frank Beckman and Donald Quarles. On isometrics of euclidean spaces. Pro- 
ceedings of the American Mathematical Society, 4:810-815, 1953. 

[5] Enrico Beltrametti and Gianni Cassinelli. The Logic of Quantum Mechanics. En- 
cyclopedia of Mathematics and its Application Vol. 15. Addison-Wesley, Read- 
ing, Massachusetts, 1981. 

[6] Marcel Berger. Geometry, volume I. Springer Verlag, Berlin, first edition, 1987. 
Corrected second printing 1994. 

[7] Marcel Berger. Geometry, volume II. Springer Verlag, Berlin, first edition, 1987. 
Corrected second printing 1996. 

[8] Vittorio Berzi and Vittorio Gorini. Reciprocity principle and the Lorentz trans- 
formations. Journal of Mathematical Physics, 10(8): 15 18-1524, 1969. 

[9] David Bleecker. Gauge Theory and Variational Principles. Number 1 in Global 
Analysis, Pure and Applied. Addison-Wesley, Reading, Massachusetts, 1981. 

[10] Hans-Jurgen Borchers and Gerhard Hegerfeld. The structure of space-time trans- 
formations. Communications in Mathematical Physics, 28:259-266, 1972. 

[11] Emile Borel. La theorie de la relativite et la cinematique. In (Euvres de Emile 
Borel, volume 3, pages 1809-181 1. Editions du Centre National de la Recherche 
Scientifique, Paris, 1972. First appeared in Comptes Rendus des seances de 
I'Academie des Sciences 156 (1913): 215-217. 



63 



[12] Max Born. Die Theorie des starren Elektrons in der Kinematik des Rela- 
tivitatsprinzips. Annalen der Physik (Leipzig), 30:1-56, 1909. 

[13] Horacio Casini. The logic of causally closed spacetime subsets. Classical and 
Quantum Gravity, 19:6389-6404, 2002. 

[14] Wojciech Cegla and Arkadiusz Jadczyk. Logics generated by causality structures, 
covariant representations of the galilei group. Reports on Mathematical Physics, 
9(3):377-385, 1976. 

[15] Wojciech Cegla and Arkadiusz Jadczyk. Causal logic of Minkowski space. Com- 
munications in Mathematical Physics, 57:213-217, 1977. 

[16] Alexander Chubarev and losif Pinelis. Linearity of space-time transformations 
without the one-to-one, line-onto-line, or constancy-of-speed-of-light assump- 
tion. Communications in Mathematical Physics, 215:433-441, 2000. 

[17] Paul Ehrenfest. Gleichformige Rotation starrer Korper und Relativitatstheorie. 
Physikalische Zeitschrift, 10(23):918, 1909. 

[18] Vladimir Fock. The Theory of Space Time and Gravitation. Pergamon Press, 
London, first english edition, 1959. 

[19] Philipp Frank and Hermann Rothe. Uber die Transformation der Raumzeitko- 
ordinaten von ruhenden auf bewegte Systeme. Annalen der Physik (Leipzig), 
34(5):825-855, 1911. 

[20] Philipp Frank and Hermann Rothe. Zur Herleitung der Lorentztransformation. 
Physikalische Zeitschrift, 13:750-753, 1912. Erratum: ibid, p. 839. 

[21] Wilhelm I. Fushchich, Vladimir M. Shtelen, and N.I. Serov. Symmetry Analysis 
and Exact Solutions of Equations of Nonlinear Mathematical Physics. Kluwer 
Academic Publishers, Dordrecht, 1993. 

[22] Domenico Giulini. Advanced Special Relativity. Oxford University Press, Ox- 
ford. To appear. 

[23] Domenico Giulini. Some remarks on the notions of general covariance and back- 
ground independence. ArXiv gr-qc/0603087 

[24] Domenico Giulini. On Galilei invariance in quantum mechanics and the 
Bargmann superselection rule. Annals of Physics (New York), 249(l):222-235, 
1996. 

[25] Domenico Giulini. Uniqueness of simultaneity. Britisch Journal for the Philos- 
ophy of Science, 52:651-670, 2001. 

[26] Domenico Giulini. Das Problem der Tragheit. Philosophia Naturalis, 39(2): 843- 
374, 2002. 

[27] Domenico Giulini and Norbert Straumann. Einstein's impact on the physics of 
the twentieth century. Studies in the History and Philisophy of Modem Physics, 
37:115-173, 2006. 



64 



[28] Rudolf Haag. Local Quantum Physics: Fields, Particles, Algebras. Texts and 
Monographs in Physics. Springer Verlag, Berlin, first 1991, second revised and 
enlarged 1996 edition, 1996. 

[29] Gerhard Hegerfeld. The Lorentz transformations: Derivation of linearity and 
scale factor. IlNuovo Cimento, 10 A(2):257-267, 1972. 

[30] Gustav Herglotz. Uber den vom Standpunkt des Relativitatsprinzips aus als 
'starr' zu bezeichnenden Korper. Annalen der Physik (Leipzig), 31:393^15, 
1910. 

[31] Wladimir von Ignatowsky. Einige allgemeine Bemerkungen zum Rela- 
tivitatsprinzip. Verhandlungen der Deutschen Physikalischen Gesellschaft, 
12:788-796, 1910. 

[32] Erdal Inonii and Eugen Wigner. On the cotraction of groups and their represen- 
tations. Proceedings of the National Academy of Sciences, 39(6):5 10-524, 1953. 

[33] Nathan Jacobson. Basic Algebra L W.H. Freeman and Co., New York, second 
edition, 1985. 

[34] Josef M. Jauch. Foundations of Quantum Mechanics. Addison-Wesley, Reading, 
Massachusetts, 1968. 

[35] Theodor Kaluza. Zur Relativitatstheorie. Physikalische Zeitschrift, 11:977-978, 
1910. 

[36] FeUx Klein. Vergleichende Betrachtungen Uber neuere geometrische Forschun- 
gen. Verlag von Andreas Deichert, Erlangen, first edition, 1872. Reprinted in 
Mathematische Annalen (Leipzig) 43 (1892) 43-100. 

[37] Max von Laue. Zur Diskussion fiber den starren Korper in der Relativitatstheorie. 
Physikalische Zeitschrift, 12:85-87, 1911. 

[38] Fritz Noether. Zur Kinematik des starren Korpers in der Relativitatstheorie. An- 
nalen der Physik (Leipzig), 31:919-944, 1910. 

[39] FeUx Pirani and Gareth Williams. Rigid motion in a gravitational field. Seminaire 
JANET (Mecanique analytique et Mecanique celeste), 5e annee(8):l-16, 1962. 

[40] John A. Rhodes and Mark D. Semon. Relativistic velocity space, Wigner rotation, 
and Thomas precession. American Journal of Physics, 72(7):943-956, 2004. 

[41] Alfred A. Robb. Optical Geometry of Motion: A New View of the Theory of 
Relativity. W. Heffer & Sons Ltd., Cambridge, 1911. 

[42] Roman U. Sexl and Hehnuth K. Urbantke. Relativity, Groups, Particles. Springer 
Verlag, Wien, first edition, 2001. First engUsh edition, succeeding the 1992 third 
revised german edition. 

[43] Arnold Sommerfeld. Uber die Zusammensetzung der Geschwindigkeiten in der 
Relativtheorie. Physikalische Zeitschrift, 10:826-829, 1909. 



65 



[44] Llewellyn Hilleth Thomas. The kinematics of an electron with an axis. Philo- 
sophical Magazine, 3:1-22, 1927. 

[45] Andrzej Trautman. Foundations and current problems of general relativity. In 
A. Trautman, F.A.E. Pirani, and H. Bondi, editors. Lectures on General Relativ- 
ity, volume 1 of Brandeis Summer Institute in Theoretical Physics, pages 1-248. 
Prentice-Hall, Inc., Englewood Cliffs, New Jersey, 1964. 

[46] Abraham Ungar. Thomas rotation and the parametrization of the Lorentz trans- 
formation group. Foundations of Physics Letters, l(l):57-89, 1988. 

[47] Abraham Ungar. Beyond Einstein's Velocity Addition Law, volume 117 of Fun- 
damental Theories of Physics. Kluwer Academic, Dordrecht, 2001. 

[48] Abraham Ungar. Analytic Hyperbolic Geometry: Mathematical Foundations and 
Applications. World Scientific, Singapore, 2005. 

[49] Helmuth Urbantke. Physical holonomy: Thomas precession, and Clifford alge- 
bra. American Journal of Physics, 58(8):747-750, 1990. Erratum ibid. 59(12), 
1991, 1150-1151. 

[50] Helmuth Urbantke. Lorentz transformations from reflections: Some applications. 
Foundations of Physics Letters, 16:111-117, 2003. ArXiv math-ph/0212038 

[51] Vladimir Varicak. Anwendung der Lobatschefskijschen Geometric in der Rela- 
tivtheorie. Physikalische Zeitschrift, 11:93-96, 1910. 

[52] Vladimir Varicak. Uber die nichteuklidische Interpretation der Relativtheorie. 
Jahresberichte der Deutschen Mathematikervereinigung (Leipzig), 21:103-127, 
1912. 

[53] Erik Christopher Zeeman. Causality implies the Lorentz group. Journal of Math- 
ematical Physics, 5(4):490^93, 1964. 



66 



